OpenCitations data downloads

Access all data dumps from OpenCitations Meta and OpenCitations Index

Data available through Zenodo, Figshare and the Internet Archive

Meta

Bibliographic metadata for all publications in the OpenCitations Index

Latest: January 2026 dump

Includes data on 129M+ bibliographic entities, 389M+ authors, and 1.3M+ publication venues

View Meta downloads
Index

OMID-to-OMID references representing all citations from multiple sources

Latest: July 2025 dump

Includes data on 2.2 billion+ citations from various sources

View Index downloads

OpenCitations Meta

The OpenCitations Meta database stores and delivers bibliographic metadata for all publications involved in the OpenCitations Index.

Most recent OpenCitations Meta data dump - January 2026

Released on 2026-01-20, compared to the previous version, includes metadata related to citing and cited bibliographic resources added in the September 2025 version of Crossref, as well as the DataCite Public Data File 2025.

Key statistics

129M+

Bibliographic entities

389M+

Authors

1.3M+

Publication venues

2,862,406 editors and 106,791,171 publishers (counted by roles, without disambiguating individuals)
Download files
Type and format Archive Size
Metadata (CSV) Download tar.gz 13GB (51GB uncompressed)
Metadata database (OpenLink Virtuoso) Download 7z 42GB (190GB uncompressed)
Metadata and provenance (RDF) Download 7z 48GB (69GB uncompressed)
Provenance database (OpenLink Virtuoso) Download 7z 144GB (1TB uncompressed)
Previous dumps

The metadata CSV is available on Figshare at the following link: https://doi.org/10.6084/m9.figshare.21747461.v11

Metadata and Provenance is available on Figshare at the following link: https://doi.org/10.6084/m9.figshare.21747536.v8

Compared to the previous dump, this one adds the metadata contained in the Crossref dump dated March 2024.

Available in CSV (metadata) and JSON-LD (metadata and provenance) formats.

Compared to the previous dump, this one incorporates OpenAlex IDs, leveraging data from the OpenAlex dump.

Available in CSV (metadata) and RDF (metadata and provenance) formats.

Compared to the previous dump, this one adds the metadata contained in the Japan Link Center (JaLC).

Available in CSV (metadata) format.

OpenCitations Index

The OpenCitations Index stores OMID-to-OMID references representing all the references gathered from several sources.

Most recent OpenCitations Index data dump - February 2026

Released on 2026-02-17, this dataset adds the citation data contained in the Crossref dump dated September 2025, as well as the DataCite Public Data File of April 2024.

Key statistics

2.4 Billion+

Citations (2,422,432,262)

Download files
Type and format Archive Size
Citation data (CSV) Download ZIP TBA
Citation data (N-Triple) Download ZIP TBA
Citation data (Scholix) Download ZIP TBA
Provenance data (CSV) Download ZIP 24GB (542GB uncompressed)
Provenance data (N-Triple) Download ZIP 144GB (4.5 TB uncompressed)
Additional files
Type and format Archive Size
Citation data sources' info (N-Triple): information regarding the data source collection (e.g., COCI, DOCI, POCI, etc) of all the citation data Download ZIP 29GB (480GB uncompressed)
Previous dumps

Available in CSV (citation data), N-Triple (citation data), SCHOLIX (citation data), CSV (provenance data), and N-Triple (provenance data).

In addition, a N-Triple and CSV dump, containing information regarding the data source collection.

Available in CSV (citation data), N-Triple (citation data), SCHOLIX (citation data), CSV (provenance data), and N-Triple (provenance data).

In addition, a N-Triple and CSV dump, containing information regarding the data source collection.

Available in CSV (citation data), N-Triple (citation data), SCHOLIX (citation data), CSV (provenance data), and N-Triple (provenance data).

In addition, a N-Triple dump containing information regarding the data source collection, and a citation count dump with the number of incoming citations to each bibliographic entity (identified by an OMID).

Available in CSV (citation data), N-Triple (citation data), SCHOLIX (citation data), CSV (provenance data), and N-Triple (provenance data).

In addition, a N-Triple dump containing information regarding the data source collection, and a citation count dump with the number of incoming citations to each bibliographic entity (identified by an OMID).

Available in CSV (citation data), N-Triple (citation data), SCHOLIX (citation data), CSV (provenance data), and N-Triple (provenance data).

In addition, a N-Triple dump containing information regarding the data source collection.