Several types of named entities (specifically, organizations and companies) get tagged as belonging to _typeGroup : "socialTag" rather than _typeGroup : "entities". The structure of "socialTag" group presupposes linking its members to URLs rather than giving exact position in text:
_typeGroup : "socialTag" id : "http://d.opencalais.com/..." socialTag : "http://d.opencalais.com/..." forenduserdisplay : "true" name : "Goodwill Industries" importance : "1" originalValue : "Goodwill Industries"
This format of output (with no offsets specified) doesn't allow to map the extracted entity to the text.
Do you happen to know if there is a way to get offsets for such entities?