public interface JCasTermAnnotator
extends org.apache.uima.analysis_component.AnalysisComponent
RareWordDictionary
s
and adds them as entities to the Cas using a TermConsumer
Author: SPF
Affiliation: CHIP-NLP
Date: 2/24/14Modifier and Type | Field and Description |
---|---|
static String |
DEFAULT_DICT_DESC_PATH |
static String |
DEFAULT_EXCLUSION_TAGS |
static String |
DEFAULT_LOOKUP_WINDOW |
static int |
DEFAULT_MINIMUM_SPAN |
static String |
DICTIONARY_DESCRIPTOR_KEY
Deprecated.
replaced by
ConfigParameterConstants.PARAM_LOOKUP_XML |
static String |
PARAM_EXC_TAGS_KEY
optional part of speech tags for tokens that should not be used for lookup
|
static String |
PARAM_MIN_SPAN_KEY
optional minimum span for tokens that should not be used for lookup
|
static String |
PARAM_WINDOW_ANNOT_KEY
specifies the type of window to use for lookup
|
Modifier and Type | Method and Description |
---|---|
void |
findTerms(RareWordDictionary dictionary,
List<FastLookupToken> allTokens,
List<Integer> lookupTokenIndices,
CollectionMap<TextSpan,Long,? extends Collection<Long>> termsFromDictionary)
Given a dictionary, tokens, and lookup token indices, populate a terms collection with discovered terms
|
Collection<RareWordDictionary> |
getDictionaries() |
@Deprecated static final String DICTIONARY_DESCRIPTOR_KEY
ConfigParameterConstants.PARAM_LOOKUP_XML
static final String PARAM_WINDOW_ANNOT_KEY
static final String PARAM_EXC_TAGS_KEY
static final String PARAM_MIN_SPAN_KEY
static final String DEFAULT_LOOKUP_WINDOW
static final String DEFAULT_EXCLUSION_TAGS
static final int DEFAULT_MINIMUM_SPAN
static final String DEFAULT_DICT_DESC_PATH
Collection<RareWordDictionary> getDictionaries()
void findTerms(RareWordDictionary dictionary, List<FastLookupToken> allTokens, List<Integer> lookupTokenIndices, CollectionMap<TextSpan,Long,? extends Collection<Long>> termsFromDictionary)
dictionary
- -allTokens
- -lookupTokenIndices
- -termsFromDictionary
- -Copyright © 2012-2017 The Apache Software Foundation. All Rights Reserved.