HeadingProcessor
- class burdoc.processors.heading_processor.HeadingProcessor(log_level: int = 20)
The HeadingProcessor takes the correctly ordered layout elements and applies additional semantic processing to identify headings and titles. It also generates a hierarchy of headings for the page.
Requires: [‘elements’]
Optional: None
Generators: [‘elements’, ‘page_hierarchy’]
- add_generated_items_to_fig(page_number: int, fig: Figure, data: Dict[str, Any])
Draw any items generated by this processor to a page image
- generates() List[str]
Return list of fields added by this processor
- requirements() Tuple[List[str], List[str]]
Return list of required data fields and list of optional data fields