@PipeBitInfo(name="Regex Sectionizer", description="Annotates Document Sections by detecting Section Headers using Regular Expressions provided in a Bar-Separated-Value (BSV) File.", products=SECTION) public class BsvRegexSectionizer extends RegexSectionizer
RegexSectionizer.SectionType
Modifier and Type | Field and Description |
---|---|
static String |
SECTION_TYPES_DESC |
static String |
SECTION_TYPES_PATH |
DIVIDER_LINE_NAME, LINE_DIVIDER_TAG, PARAM_TAG_DIVIDERS
Constructor and Description |
---|
BsvRegexSectionizer() |
Modifier and Type | Method and Description |
---|---|
static org.apache.uima.analysis_engine.AnalysisEngineDescription |
createEngineDescription(String sectionTypesPath) |
protected void |
loadSections()
Load Sections in a manner appropriate for the Regex Sectionizer
|
addSectionType, getSectionTypes, initialize, isBoolean, process, shouldParseSegment
getRequiredCasInterface, process
getCasInstancesRequired, hasNext, next
public static final String SECTION_TYPES_PATH
public static final String SECTION_TYPES_DESC
protected void loadSections() throws org.apache.uima.resource.ResourceInitializationException
loadSections
in class RegexSectionizer
org.apache.uima.resource.ResourceInitializationException
- -public static org.apache.uima.analysis_engine.AnalysisEngineDescription createEngineDescription(String sectionTypesPath) throws org.apache.uima.resource.ResourceInitializationException
org.apache.uima.resource.ResourceInitializationException
Copyright © 2012-2017 The Apache Software Foundation. All Rights Reserved.