Resources

Presentations

‘The Collaborative Metadata Enrichment Taskforce (COMET): Uniting Stakeholders for Collaborative Metadata Enrichment’ presented by John Chodacki at the CNI Spring 2025 Membership Meeting April 7-8, 2025.

On March 5, 2025, the Collaborative Metadata Enrichment Taskforce (COMET) conveners held a wider community webinar to introduce the work of COMET and it's Community Call to Action.

Case Studies

Project Briefs

Resource Type Classification of DOIs

We're exploring an automated system to improve how research outputs are categorized, piloting the approach with DOI records registered with DataCite. Instead of vague labels like "Text" or "Other," our classifier will assign specific, meaningful categories like "Dataset," "Journal Article," or "Software", when appropriate, to make research more discoverable and useful.

ArXiv Preprint Parsing and OpenAlex Parsing Assessment

We're developing improved parsing methods for arXiv preprints whilst simultaneously evaluating how well OpenAlex captures author and affiliation metadata through the well-established GROBID parsing tool.

ArXiv Preprint Matching

Here, we're developing automated tools to connect arXiv preprints with their corresponding published journal articles, creating a comprehensive map of how research moves from early sharing to formal publication.

Add or Improve Titles for Records with No Titles or Generic Titles

For this project, we're developing systematic approaches to identify and repair missing or generic titles in DataCite records, transforming unhelpful text like "Dataset" or blank title fields into meaningful, descriptive titles that enable proper discovery and citation.

Reconcile PKP Beacon Journals with OpenAlex Affiliation Metadata

We're evaluating how accurately OpenAlex captures affiliation metadata from journals hosted on the PKP platform, creating pathways to improve metadata quality for open access journals that serve diverse global research communities.