question

Upvotes
Accepted
1 0 0 3

Exclude certain html tags from calais processing

Hi,

I am using tagging API using URL "https://api.thomsonreuters.com/permid/calais". I pass HTML content to the API for processing. I want to exclude certain parts of the HTML from being processed. For example, tags with class "robots-nocontent" or hidden elements or by specifying something else on the tag.

How can I achieve this?

intelligent-tagging-apiintelligent-taggingopen-calais-apisemantic-metadata-tagging
icon clock
10 |1500

Up to 2 attachments (including images) can be used with a maximum of 5.0 MiB each and 10.0 MiB total.

1 Answer

Upvote
Accepted
331 1 2 2

HTML content can be tagged using the following header: Content-Type: text/html

If there are sections that you wish not to tag, you can remove them prior to submitting the API request.

icon clock
10 |1500

Up to 2 attachments (including images) can be used with a maximum of 5.0 MiB each and 10.0 MiB total.

Click below to post an Idea Post Idea