Class PageMetadataConnectorConfiguration
- Namespace
- Datafication.Connectors.WebConnector.Connectors
- Assembly
- Datafication.WebConnector.dll
Configuration for the page metadata connector.
public class PageMetadataConnectorConfiguration : WebConnectorConfigurationBase, IDataConnectorConfiguration
- Inheritance
-
objectPageMetadataConnectorConfiguration
- Implements
- Inherited Members
Remarks
This connector extracts structured metadata from web pages including standard meta tags, Open Graph data, Twitter Cards, and JSON-LD.
Properties
CustomMetaTags
Gets or sets custom meta tags to extract.
public Dictionary<string, string> CustomMetaTags { get; set; }
Property Value
- Dictionary<string, string>
Remarks
Key is the column name, value is the meta name or property to extract. Example: { "Generator": "generator", "Viewport": "viewport" }
ExtractAllMetaTags
Gets or sets whether to extract all meta tags.
public bool ExtractAllMetaTags { get; set; }
Property Value
- bool
Remarks
When true, extracts all meta tags as individual columns using the name/property as the column name. This is useful for exploration but may create many columns.
ExtractJsonLd
Gets or sets whether to extract JSON-LD structured data.
public bool ExtractJsonLd { get; set; }
Property Value
- bool
Remarks
When true (default), extracts JSON-LD script content as a JSON string. This includes Schema.org structured data.
ExtractLinkTags
Gets or sets whether to extract link tags.
public bool ExtractLinkTags { get; set; }
Property Value
- bool
Remarks
When true, extracts link tags like canonical, alternate, icon, etc.
ExtractOpenGraph
Gets or sets whether to extract Open Graph metadata.
public bool ExtractOpenGraph { get; set; }
Property Value
- bool
Remarks
When true (default), extracts Open Graph protocol tags (og:title, og:image, etc.) used by Facebook and other social platforms.
ExtractStandardMeta
Gets or sets whether to extract standard meta tags.
public bool ExtractStandardMeta { get; set; }
Property Value
- bool
Remarks
When true (default), extracts standard meta tags like description, keywords, author, etc.
ExtractTwitterCard
Gets or sets whether to extract Twitter Card metadata.
public bool ExtractTwitterCard { get; set; }
Property Value
- bool
Remarks
When true (default), extracts Twitter Card tags (twitter:card, twitter:title, etc.)
SingleRowResult
Gets or sets whether the result should be a single row with all metadata.
public bool SingleRowResult { get; set; }
Property Value
- bool
Remarks
When true (default), returns a single row with all metadata as columns. When false, returns multiple rows with name/value pairs.