Change Log

v10.2.1 (May 8, 2025)
Added
0.63% increase in the number of research products (+1.9Mi)
Added
0.29% increase (+324K) in the number of research products with affiliation information
Added
6.5% increase (+288K) in the number of funded research products
Added
Fixed the use of titles blacklist in the deduplication of research products. In the previous version, the titles blacklist was not used in the deduplication process, leading to a higher number of duplicates. The deduplication process now uses the titles blacklist to identify and remove duplicates more effectively. However, the scope of this change is limited due to the relatively small number of records in the titles blacklist, currently between 200 and 300.
Changed
Updated Crossref publications to include contents until March 2025
Changed
Updated ORCID contents until March 2025
Changed
Updated Datacite contents until March 2025
Changed
Updated PubMed contents until March 2025
v10.2.0 (Apr 3, 2025)
Added
Added an extended set of inferred RAIDs consisting of ~30K records, grouping under the same research activity publications, projects and organizations.
Added
Introduced a blacklisting mechanism to remove research products that are withdrawn from Zenodo. The blacklist today contains 1.2Mi DOIs that, starting from this release, will be systematically removed from the Graph. As these records might have relations with other entities, the relations will be removed as well. In the context of this update we do observe a decrease of ~5% (254K) in the number of research products related to a project, the majority of them being related to the withdrawn DOIs.
Changed
Updated Crossref publications to include contents until February 2025
Changed
Updated ORCID contents until February 2025
Changed
Updated Datacite contents until February 2025
Changed
Updated PubMed contents until February 2025
v10.1.0 (Mar 6, 2025)
Added
Added person entity, created out of the ORCID works with a DOI, and linked to the research products, the organizations, and the projects
Added
Introduced ~1K Research Activity Identifier records, inferred out of the graph contents. These records represent research activities that materialise in the graph as groups of scientific results on the same topics
Added
Introduced affiliation relationships stated by end users on the OpenAIRE Explore portal
Changed
Multimedia types (Film, Image, Sound, Audiovisual) moved from Dataset to OtherResearchProduct
Changed
Updated mapping applied to the contents from Microsoft Academic Graph: Journal records are now typed as Article
Changed
Updated Crossref publications to include contents until January 2025
Changed
Updated ORCID contents until January 2025
Changed
Updated Datacite contents until January 2025
Changed
Updated PubMed contents until January 2025
v10.0.1 (Jan 27, 2025)
Added
Update performed incrementally adding newer records from Crossref and Datacite resulting in ~13K more datasets (0.02%) and ~950K more publications (0.5%).
Changed
Updated Crossref publications to include contents until December 2024
Changed
Updated Datacite contents until December 2024
v10.0.0 (Dec 22, 2024)
Added
Updated the public datasets on Zenodo with the latest version of the graph. This dataset responds to an updated version of the schema, available at 10.5281/zenodo.14608526.
Added
The complete OpenAIRE dataset is available at 10.5281/zenodo.14582029, while the other public depositions are being updated and will be soon available.
Added
General increase of the scientific products linked to funding across all the funders +0.7% (+32K), in particular the products funded by IReL increased by 17% (+1414) the products funded by the NIH increased by 1.6% (+15K)
Added
General increase of the publications with affiliation information +0.57% (+570K)
Changed
Updated Crossref publications to include contents until November 2024
Changed
Updated ORCID contents until November 2024
Changed
Updated Datacite contents until November 2024
v9.0.1 (Dec 3, 2024)
Added
~12% increase (+68K) in the number of research products funded by UK Research and Innovation (UKRI)
Added
~16% increase (+131K) in the number of research products funded by the National Institute of Health (NIH)
Added
~11% increase (+20K) in the number of research products funded by Wellcome Trust (WT)
Added
General increase of the scientific products with ORCID identified authors +2.18% since October 2024 (+108K)
Changed
Updated Crossref publications to include contents until October 2024
Changed
Updated ORCID contents until October 2024
Changed
Updated Datacite contents until October 2024
Changed
Some repositories have been removed in response to the recent stricter application of acquisition policies (data sources that do not match OpenAIRE Guidelines), causing a reduction of pre-prints;
Changed
The deduplication process has been further refined resulting in different version of pre-prints to be merged.
v9.0.0 (Oct 22, 2024)
Added
~2.5% increase (+6.7Mi) in the number of research products
Added
~6.35% increase (+11.9Mi) in the number of affiliations
Added
~7.3% increase (+311K) in the number of funded research products
Added
Import of SDG classifications without a DOI
Added
Introduced plugins for collecting research results from the OSF preprints server and the UKRI registry
Changed
Updated Crossref publications to include contents until Aug 2024 and updated mapping so that records with a relationship "is-review-of" are mapped as publication of type "Review". force the hostedby of Crossref records with DOI prefix 10.3410 and 10.12703 to the H1 Connect data source.
Changed
Updated ORCID and Datacite contents until Sept 2024
Changed
Improvements in the comparators used in the organization deduplication.
Changed
Changed the selection criteria for the pivot record of a group so that by best pid type becomes the first criteria, as consequence pivots will converge to records having DOI pid.
Changed
Community tags added to all the entity types.
v8.0.1 (Aug 8, 2024)
Added
Introduced mapping of affiliations from publisher websites
Changed
Updated Crossref publications to include contents until June 2024
Changed
Updated ORCID contents until July 2024
Changed
Updated Datacite contents until July 2024
Changed
Include only FoS L1..L2 in the record serialization
v8.0.0 (Jul 14, 2024)
Added
General increase of the scientific products with ORCID identified authors +0.43% (+145K)
Changed
Improved matching of organizations in the deduplication algorithm, leading to less false positives
Changed
Updated Crossref publications to include contents until May 2024
Changed
Updated ORCID contents until June 2024
Changed
Updated Datacite contents until June 2024
Changed
[Updated serialization of the data model]: The serialization of the property names is changed to camelCase
Changed
[Updated serialization of the data model]: citationCount, influence, popularity, impulse, all of them typed as Double
Changed
[Updated serialization of the data model]: citationClass, influenceClass, impulseClass, popularityClass, all of them typed as String
Changed
[Updated serialization of the data model]: The element datasettype was renamed to type
v7.2.0 (Jun 19, 2024)
Added
Introduced new Field of Science classifications for publications, reaching a total of ~77.2Mi publications classified
Added
General increase of the affiliations +20% (from 162Mi to 195Mi)
Added
General increase of the scientific products with ORCID identified authors +10% (from 3.09Mi to 3.39Mi)
Changed
Revised deduplication configuration to better exploit resource types
Changed
The DOIBoost dataset was superseded by the direct aggregation of its datasources: Crossref, Unpaywall, Microsoft Academic Graph, ORCID. See the aggregation of the non compatible sources section to know more details
Changed
Relaxed Crossref publication inclusion criteria, now accepting records without author information, leading to a +15% increase (from 127Mi to 146Mi records). Included contents until April 2024
Changed
Updated ORCID contents until April 2024
Changed
Updated Datacite contents until April 2024
v7.1.3 (Apr 22, 2024)
Added
Introduced new Field of Science classifications, reaching a total of ~73Mi publications classified
Added
General increase of the funded scientific outputs, thanks to the full-text mining scanning new OpenAccess publications, some examples:
Added
European Commission - EC +7% (from 1.52Mi to 1.62Mi)
Added
Irish Research Council - IRC +7% (from 12.7K to 13.5K)
Added
French National Research Agency - ANR +5.8% (from 91.5K to 96.8K)
Added
National Institute of Health - NIH +5% (from 594K to 626K)
Added
UK Research and Innovation - UKRI +3.7% (from 434K to 450K)
Added
General increase of the scientific products with author affiliation information +2% (from 83.12Mi to 84.88Mi)
Changed
Updated Crossref publications to include contents until March 2023
Changed
Updated Datacite contents until March 2024
Changed
Updated ORCID contents until March 2024
v7.1.2 (Mar 27, 2024)
Added
General increase of the funded scientific outputs, thanks to the full-text mining scanning new OpenAccess publications
Changed
Updated Crossref publications to include contents until February 2023
Changed
Updated Datacite contents until February 2024
Changed
Updated ORCID contents until February 2024
v7.1.1 (Mar 6, 2024)
Added
Updated the content import criteria applied to Datacite, resulting in +13Mi Other Research Products (+167%)
Added
Introduced project PIDs; DOI currently available for grants funded by FCT and TWCF
Changed
Scientific products typed as "Collection" categorized under "Research Data" instead of "Other Research Product".
Changed
Updated Crossref publications to include contents until January 2023
Changed
Updated Datacite contents until January 2024
v7.1.0 (Mar 20, 2024)
Added
The scientific products aggregated increased by ~5Mi records (+1.6%)
Changed
A refined version of the deduplication strategy allowed to catch more duplicates among the scientific products, implying a decrease of their total number of ~3.2Mi (-1.35%).
Changed
Updated Crossref publications to include contents until November 2023
Changed
Updated Datacite contents until December 2023
v7.0.0 (Jan 6, 2024)
Added
Scientific products increased by ~3Mi records (+1.26%)
Added
The number of relations increased by 28.6Mi (+1%)
Added
Funded contents increased by 5%, from 3.6Mi to 3,8Mi. Funders that recorded the highest increase include, for example, EC with +120K linked research products, and SFI with +1K products.
Changed
[New field]: ResearchProduct.isGreen (true, false): indicates whether or not the researh product was published following the green open access model
Changed
[New field]: ResearchProduct.openAccesColor (bronze, gold, hybrid): indicates the specific open access model used for the publication
Changed
[New field]: ResearchProduct.isInDiamondJournal (true, false): indicates whether or not the research product was published in a diamond journal
Changed
[New field]: ResearchProduct.publicly-funded (true, false): indicates whether or not the grants acknowledged by the publication come from public funds
v6.2.2 (Nov 23, 2023)
Added
Imported Opencitation's POCI dataset, containing citations among publications in PubMed
Added
Imported Affiliations from Crossref and from PubMed
Added
Imported Software Heritage identifiers for Software records
Added
Extended coverage of Irish funders imported from Crossref
Added
Peer reviewed material identified with a revised heuristic that allowed to improve the coverage
Added
Project references identified by TDM increased by ~10%
Added
Introduced new Field of Science classifications for ~40Mi publications
Changed
Updated Crossref publications to include contents until October 2023
Changed
Updated Datacite contents until October 2023
Changed
Indicators regarding data source downloads and views taken by usage counts from September 2023
v6.1.1 (Oct 15, 2023)
Added
Affiliation (result to organization) relations from Crossref
Added
Links to the full text of research products
Added
Cleaning for author and publisher names (get rid of tabs, CR characters, \n(s), escape double quotes)
Changed
Projects without a grant code are removed
Changed
Crossref dump from July 2023
Changed
ORCID works without a DOI from March 2023
Changed
Usage counts from July 2023
Changed
Datacite contents from early July 2023
Changed
OpenCitations relations from December 2022
v6.0.0 (Aug 15, 2023)
Changed
Relationship data model: flattened properties source, sourceType, target, targetType
Changed
BIP! indicators are now serialised as an array
Changed
Crossref dump from June 2023
Changed
ORCID works without a DOI from June 2023
Changed
Usage counts from June 2023
Changed
Datacite contents from June 2023
Changed
OpenCitations relations from January 2023
Changed
BIP! indicators from June 2023
Changed
New Datasources/Services were added, collected from an updated EOSC Service catalogue endpoint
v5.2.0 (Jul 16, 2023)
Added
Citations imported from Crossref & MAG
Added
FoS and SDG classifications introduced for ~16Mi research products
Changed
Removed the numerical prefix from the OpenAIRE identifiers
Changed
Dataset file names in the Zenodo depositions changed from dump to dataset
Changed
Crossref dump from May 2023
Changed
ORCID works without a DOI from June 2023
Changed
Usage counts from April 2023
Changed
Datacite contents from June 2023
Changed
OpenCitations relations from January 2023
Changed
Deduplication of the datasource
Changed
Avoid duplicated organisation PIDs
v5.1.3 (Jun 12, 2023)
Added
Datasource and project level usage counts
Changed
Crossref dump from April 2023
Changed
ORCID works without a DOI from May 2023
Changed
Usage counts from April 2023
Changed
Datacite contents from May 2023
Changed
OpenCitations relations from January 2023
Changed
Deduplication of the datasource
v5.1.2 (Apr 3, 2023)
Changed
Crossref dump from February 2023
Changed
ORCID works without a DOI from March 2023
Changed
Usage counts from February 2023 (+76% Downloads per Datasource for 2023)
Changed
Datacite contents from mid March 2023
Changed
OpenCitations relations from January 2023
v5.1.1 (Mar 1, 2023)
Added
Revised SDG classification: improved coverage (+600K classified DOIs)
Added
General increase of the funded scientific outputs, thanks to the full text mining scanning new OpenAccess publications
Added
Integrated contents from: EMBL-EBIs Protein Data Bank in Europe and UniProtKB/Swiss-Prot
Changed
Crossref dump from January 2023
Changed
ORCID works without a DOI from January 2023
Changed
Usage counts from January 2023
Changed
Datacite contents from mid February 2023
Changed
OpenCitations relations from December 2022
v5.1.0 (Jan 30, 2023)
Added
Revised SDG classification: better accuracy, lower coverage (will improve in the next months)
Changed
Crossref dump from December 2022
Changed
ORCID works without a DOI from January 2023
Changed
Usage counts from December 2022
Changed
DataCite contents from January 2023
v5.0.0 (Dec 28, 2022)
Added
Impact & Usage indicators at the level of the Result
Added
Beginner's kit in the Downloads section
Added
New relationship types were introduced
Changed
FOS and SDGs were removed from the result subjects
Changed
Measures were removed from the result instance
Changed
Updated DOIBoost to include publications from Crossref and the works from ORCID with a DOI until November 2022
Changed
Added ORCID works without a DOI from November 2022
OpenAIRE Service Catalogue receives funding from the European Union’s Horizon 2020 Research and Innovation programme under OpenAIRE-Advance (No. 777541) and OpenAIRE Nexus (No. 101017452)