Basic website scraping. Works with most websites, but might fail with javascript heavy ones. This connector takes a given URL for a website and extracts the available text from it. Once extracted, the contents are outputted as an HTML.

Properties

Required properties:

  • url: URL for website

Available metadata

  • url: URL for website

Compatible loaders:

  • AutoLoader
  • HTMLLoader

Usage