OpenCitations data downloads
Access all data dumps from OpenCitations Meta and OpenCitations Index
Data available through Zenodo, Figshare and the Internet Archive
Bibliographic metadata for all publications in the OpenCitations Index
Latest: January 2026 dump
Includes data on 129M+ bibliographic entities, 389M+ authors, and 1.3M+ publication venues
View Meta downloadsOMID-to-OMID references representing all citations from multiple sources
Latest: July 2025 dump
Includes data on 2.2 billion+ citations from various sources
View Index downloadsOpenCitations Meta
The OpenCitations Meta database stores and delivers bibliographic metadata for all publications involved in the OpenCitations Index.
Most recent OpenCitations Meta data dump - January 2026
Released on 2026-01-20, compared to the previous version, includes metadata related to citing and cited bibliographic resources added in the September 2025 version of Crossref, as well as the DataCite Public Data File 2025.
Key statistics
129M+
Bibliographic entities
389M+
Authors
1.3M+
Publication venues
Download files
| Type and format | Archive | Size |
|---|---|---|
| Metadata (CSV) | Download tar.gz | 13GB (51GB uncompressed) |
| Metadata database (OpenLink Virtuoso) | Download 7z | 42GB (190GB uncompressed) |
| Metadata and provenance (RDF) | Download 7z | 48GB (69GB uncompressed) |
| Provenance database (OpenLink Virtuoso) | Download 7z | 144GB (1TB uncompressed) |
Previous dumps
Compared to the previous version, includes metadata from the April 2025 version of Crossref and the December 2024 dump of JaLC.
Available in CSV (metadata), Kubernetes-ready database, RDF (metadata and provenance), and RDF (provenance database).
The metadata CSV is available on Figshare at the following link: https://doi.org/10.6084/m9.figshare.21747461.v11
Metadata and Provenance is available on Figshare at the following link: https://doi.org/10.6084/m9.figshare.21747536.v8
Compared to the previous dump, this one adds the metadata contained in the Crossref dump dated March 2024.
Available in CSV (metadata) and JSON-LD (metadata and provenance) formats.
Compared to the previous dump, this one incorporates OpenAlex IDs, leveraging data from the OpenAlex dump.
Available in CSV (metadata) and RDF (metadata and provenance) formats.
Compared to the previous dump, this one adds the metadata contained in the Japan Link Center (JaLC).
Available in CSV (metadata) format.
-
2023-10-24: Adds the metadata from OpenAIRE and the Crossref dump dated September 2023.
Available in CSV (metadata) and JSON-LD (metadata and provenance) formats.
-
2023-06-28: Adds the metadata from the dump of NIH Open Citation Collection dated November 2022.
Available in CSV (metadata) and JSON-LD (metadata and provenance) formats.
-
2023-02-24: Adds the metadata from the last dump of DataCite dated 22 October 2021.
Available in CSV (metadata) and JSON-LD (metadata and provenance) formats.
-
2022-12-20: Based on open references to works with DOIs within the Crossref dump dated December 2022.
Available in CSV (metadata) and JSON-LD (metadata and provenance) formats.
OpenCitations Index
The OpenCitations Index stores OMID-to-OMID references representing all the references gathered from several sources.
Most recent OpenCitations Index data dump - February 2026
Released on 2026-02-17, this dataset adds the citation data contained in the Crossref dump dated September 2025, as well as the DataCite Public Data File of April 2024.
Key statistics
2.4 Billion+
Citations (2,422,432,262)
Download files
| Type and format | Archive | Size |
|---|---|---|
| Citation data (CSV) | Download ZIP | TBA |
| Citation data (N-Triple) | Download ZIP | TBA |
| Citation data (Scholix) | Download ZIP | TBA |
| Provenance data (CSV) | Download ZIP | 24GB (542GB uncompressed) |
| Provenance data (N-Triple) | Download ZIP | 144GB (4.5 TB uncompressed) |
Additional files
| Type and format | Archive | Size |
|---|---|---|
| Citation data sources' info (N-Triple): information regarding the data source collection (e.g., COCI, DOCI, POCI, etc) of all the citation data | Download ZIP | 29GB (480GB uncompressed) |
Previous dumps
Available in CSV (citation data), N-Triple (citation data), SCHOLIX (citation data), CSV (provenance data), and N-Triple (provenance data).
In addition, a N-Triple and CSV dump, containing information regarding the data source collection.
Available in CSV (citation data), N-Triple (citation data), SCHOLIX (citation data), CSV (provenance data), and N-Triple (provenance data).
In addition, a N-Triple and CSV dump, containing information regarding the data source collection.
Available in CSV (citation data), N-Triple (citation data), SCHOLIX (citation data), CSV (provenance data), and N-Triple (provenance data).
In addition, a N-Triple dump containing information regarding the data source collection, and a citation count dump with the number of incoming citations to each bibliographic entity (identified by an OMID).
Available in CSV (citation data), N-Triple (citation data), SCHOLIX (citation data), CSV (provenance data), and N-Triple (provenance data).
In addition, a N-Triple dump containing information regarding the data source collection, and a citation count dump with the number of incoming citations to each bibliographic entity (identified by an OMID).
Available in CSV (citation data), N-Triple (citation data), SCHOLIX (citation data), CSV (provenance data), and N-Triple (provenance data).
In addition, a N-Triple dump containing information regarding the data source collection.