DataAddresses in ScrapeMaps can be displayed in the Details Properties dialog. A DataAddress specifies the user defined name and description, the selector, and some write details for scraping data.
The DataAddress “ShoeDescription” is selected in the screenshot below. The user defined Name of a DataAddress is used as the column header when the scraped data is written to the file. A Name must be unique within a ScrapeMap and it cannot contain spaces or special characters.
The DataAddress’ user defined Description is used to store any additional text information about the DataAddress. It does not affect how the data is written or scraped.
Selector is a text field that is used to determine how a DataAddress’ data is selected. In the case of web content it is a CSS Selector derived from ScrapeMate’s algorithm. It changes depending on the kind of ScrapeMap and the kind of data that is selected. If the Selector is not capturing the data that you would prefer, you can manually edit it to target the correct data.
By default missing data is flagged as a Medium severity problem during the data scraping process. By enabling the Ignore Missing Data option, missing data will be flagged as a Notification Only problem.
Kind of Data To Capture allows the user to choose between capturing the text of a web element or the URL of a web link. This can be useful for recording URLs of search results. This option is set by default while you are creating a DataAddress using the ScrapeMap Wizard. However, you can change this option after a DataAddress’ creation in the Details Properties Dialog.
Is Key DataAddress impacts how the data is written to the specified destination. If the UpdateExistingTable write setting is selected, All Key DataAddresses will be used to match the scraped data to the data already written. If data has been written with values matching all Key DataAddress fields then the remaining fields will be overwritten with the latest data. If CreateNewTable is selected the Is Key DataAddress option will be ignored.
Labeled DataAddresses have the Is Labeled option enabled and use data scraped from the source to populate the column header when written. The label is derived from a CSS Selector. Label Overrides allow the user to manually override label text with a label of their own choosing. Specify the OriginalLabel and the override to enable this functionality. Labels must DataAddress naming conventions in that they cannot contain spaces or special characters. If a Label contains these a Label Override can be used to replace the invalid label. If an invalid Label is not overridden it will not be written. Labeled DataAddresses are created with their Is Labeled property enabled by the ScrapeMap Wizard. All Table DataAddresses are labeled series with overrides automatically populated.