public final class BsvRareWordDictionary extends java.lang.Object implements RareWordDictionary
MemRareWordDictionary
with a file reader.
Author: SPF
Affiliation: CHIP-NLP
Date: 1/9/14Modifier and Type | Field and Description |
---|---|
private RareWordDictionary |
_delegateDictionary |
private static java.lang.String |
BSV_FILE_PATH |
private static Logger |
LOGGER |
Constructor and Description |
---|
BsvRareWordDictionary(java.lang.String name,
java.io.File bsvFile) |
BsvRareWordDictionary(java.lang.String name,
java.lang.String bsvFilePath) |
BsvRareWordDictionary(java.lang.String name,
UimaContext uimaContext,
java.util.Properties properties) |
Modifier and Type | Method and Description |
---|---|
private static RareWordTermMapCreator.CuiTerm |
createCuiTuiTerm(java.lang.String... columns) |
java.lang.String |
getName()
The Type identifier and Name are used to maintain a collection of dictionaries,
so the combination of Type and Name should be unique for each dictionary if possible.
|
java.util.Collection<RareWordTerm> |
getRareWordHits(FastLookupToken fastLookupToken)
Any single token can exist in zero or more terms in the dictionary.
|
java.util.Collection<RareWordTerm> |
getRareWordHits(java.lang.String rareWordText)
Get all terms within the dictionary that contain a given rare word
|
private static java.util.Collection<RareWordTermMapCreator.CuiTerm> |
parseBsvFile(java.io.File bsvFile)
Create a collection of
RareWordTermMapCreator.CuiTerm Objects
by parsing a bsv file. |
private static final Logger LOGGER
private static final java.lang.String BSV_FILE_PATH
private RareWordDictionary _delegateDictionary
public BsvRareWordDictionary(java.lang.String name, UimaContext uimaContext, java.util.Properties properties)
public BsvRareWordDictionary(java.lang.String name, java.lang.String bsvFilePath)
public BsvRareWordDictionary(java.lang.String name, java.io.File bsvFile)
public java.lang.String getName()
getName
in interface RareWordDictionary
public java.util.Collection<RareWordTerm> getRareWordHits(FastLookupToken fastLookupToken)
getRareWordHits
in interface RareWordDictionary
fastLookupToken
- a single-word tokenpublic java.util.Collection<RareWordTerm> getRareWordHits(java.lang.String rareWordText)
getRareWordHits
in interface RareWordDictionary
rareWordText
- text of the rare wordrareWordText
private static java.util.Collection<RareWordTermMapCreator.CuiTerm> parseBsvFile(java.io.File bsvFile)
RareWordTermMapCreator.CuiTerm
Objects
by parsing a bsv file. The file can be in one of two columnar formats:
CUI|Text
orCUI|TUI|Text
orCUI|TUI|Text|PreferredTerm
If the TUI column is omitted then the entityId for the dictionary is used as the TUIbsvFile
- file containing term rows and bsv columnsprivate static RareWordTermMapCreator.CuiTerm createCuiTuiTerm(java.lang.String... columns)
columns
- two or three columns representing CUI,Text or CUI,TUI,Text respectively