HeadingProcessor

class burdoc.processors.heading_processor.HeadingProcessor(log_level: int = 20)

The HeadingProcessor takes the correctly ordered layout elements and applies additional semantic processing to identify headings and titles. It also generates a hierarchy of headings for the page.

Requires: [‘elements’]

Optional: None

Generators: [‘elements’, ‘page_hierarchy’]

add_generated_items_to_fig(page_number: int, fig: Figure, data: Dict[str, Any])

Draw any items generated by this processor to a page image

generates() List[str]

Return list of fields added by this processor

requirements() Tuple[List[str], List[str]]

Return list of required data fields and list of optional data fields