Article and journal metadata

Metadata is data that provides information about other data. In scholarly publishing, metadata refers to structured information that describes the attributes of an article, including its title, authors, date of publication, copyright and licensing status, and more.

Metadata should be created following appropriate standards and it is commonly deposited via Crossref (article metadata), DataCite (metadata on other research objects such as data, software and more) or indexes (journal metadata). Journals have a responsibility to make their metadata easily available, so that any contents published are discoverable by readers via a broad range of search approaches.

Metadata standards

It is important to provide a common structure for metadata, so that it can be digitally read and automatically presented to users. Metadata standards promote interoperability, by helping ensure that records remain accurate and consistent.

Using metadata standards also enables and promotes development indexing and discovery services, particularly when in combination with persistent identifiers for articles (e.g. digital object identifiers, permalinks), authors (ORCID) and organisations (e.g. Research Organization Registry).

Some notable metadata standards include Dublin Core, Machine Readable Cataloging (MARC), Crossref and DataCite. Whilst JATS is primarily a format for storing the entire article (see Structured content), it is also used as a metadata interchange format between publishers and archivists

Typical differences between article and journal metadata are noted in the following table.

Focus of the metadata	Typical fields
Article	Title Author(s), including for example first name, middle name, last name, ORCID, CREDIT, institutional affiliation Date of Publication Volume/Issue details Page Numbers Abstract Keywords Digital Object Identifier Funding Metadata References
Journal	Journal title Journal abbreviation ISSN Article types/sections Copyright information Publisher

Making article metadata available to readers and discovery platforms

Journals typically have clear metadata displayed alongside individual articles. This helps readers identify the title of the article, its author(s), publication data and persistent identifier, for example. If the journal publishes JATS XML versions of the article, this can be used to supply metadata in a structured form, which can be helpful for text and data mining. Metadata can also be embedded in pdf documents, using the Extensible Metadata Platform. Most publishing systems (e.g. Open Journals System, Janeway) will make Dublin Core metadata available on the article’s abstract page, so that it can be read by referencing tools (e.g. Zotero), and will provide an Open Access Initiative Protocol for Metadata Harvesting (OAI-PMH) feed for metadata harvesting.

References