The aim of this library is to be a comprehensive source for extracting all HTML-embedded metadata. Currently, it supports Schema.org microdata using a third-party library, a native BEPress, Dublin Core, Highwire Press, JSON-LD, Open Graph, Twitter, EPrints, PRISM, and COinS implementation, and some general metadata that doesn't belong to a particular standard (for instance, the content of the title tag, or meta description tags). Planned is support for RDFa, AGLS, and other yet unheard-of metadata types. Contributions and requests for other metadata types are welcome! You can also pass an options object as the first argument containing extra parameters. Some websites require the user-agent or cookies to be set in order to get the response.

Features

  • Promise-based
  • Callback-based
  • Currently it supports Schema.org microdata
  • MetaData html scraper and
  • Parser for Node.js
  • Supports Promises and callback style

Project Samples

Project Activity

See All Activity >

Categories

Web Scrapers

License

MIT License

Follow html-metadata

html-metadata Web Site

Other Useful Business Software
Jesta I.S. | Enterprise Software For Retail and Supply Chain Icon
Jesta I.S. | Enterprise Software For Retail and Supply Chain

Transition from fragmented entry-level or legacy systems to an enterprise suite.

Unify your people and operations across all departments and channels. Discover end-to-end retail, wholesale, and supply chain management software suites designed to scale.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of html-metadata!

Additional Project Details

Programming Language

JavaScript

Related Categories

JavaScript Web Scrapers

Registered

2023-04-12