Tutorial - Define Extract Rules

How to define extract rules?






First thinking about that the result of you want to extract from a website, is an excel table. So the Deine Columns means to define the extract rules of a detail page. The data extracted from the detail page will be in one row.

For example, there is a website that sells some food. This Step need us to navigate to the food detail page.

We build a sandbox for you to test the example, find it here.

1. Select what you want in the detail page.

As the image below, we want to extract the product “Bread” ’s image url, title, price, review counts, and description.

detail-page-extraction-requirement

Just move the mouse to the target detail page, and click the items that we mentioned above.

detail-page-select-items-gif

Remember that each item we selected will generate an rule that how to define a column in the result excel.

2. Remove the unnecessary items.

One of the AnyPicker’s html element extracting feature is that, AnyPicker will separate each items of a large container. So maybe in this step, we get more items. Just remove it in the Setting Panel.

detail-page-remove-unnecessary-items

3. Set the columns name.

Change each field name (column name) to more meaningfully name. Than checkout the DATA PREVIEW block in the bottom. It will show you one row data.

detail-page-data-preview

Now go to the next step!