2.6. Continuous text formats

Octopus: The perfect solution for processing continuous text formats such as EPUB and HTML

Octopus takes the processing of continuous text formats such as EPUB and HTML to a new level. These widely used formats, which play a central role in the digital world, can be effortlessly converted into structured, machine-readable data with Octopus. Whether you want to prepare content for eBooks, websites or scientific publications, Octopus offers you the flexibility and precision you need.

Why continuous text formats such as EPUB and HTML?
EPUB and HTML are the basis for digital content that is used on different platforms and devices. These formats often contain a variety of elements that Octopus can process seamlessly:

  • Formulae: Scientific and mathematical content is recognised and correctly interpreted.
  • Tables: Data in tabular form is extracted and provided in structured form.
  • Links and references: Hyperlinks and cross-references are retained and can be processed further.
  • Footnotes: Important additional information is recognised and integrated.
  • Headings: The hierarchical structure of the document is analysed and adopted.
  • Images: Graphics and visual content are extracted and integrated into the target structure.

The advantages of Octopus
Octopus enables you to transform continuous text formats such as EPUB and HTML into structured data - quickly, reliably and without manual effort. The platform automatically recognises the various elements of these formats and prepares them for further processing. This is particularly useful for:

  • Cross-media publishing: Content can be prepared in a media-neutral way and published in various formats.
  • Archiving: Digital content is stored in standardised formats that can be used in the long term.
  • Automation: Recurring tasks such as the conversion of eBooks or website content are efficiently automated.

With Octopus, you save time, reduce errors and create a basis for the seamless integration of your content into digital workflows. Utilise the full range of possibilities offered by continuous text formats such as EPUB and HTML and transform them into valuable, structured data.

Octopus - your solution for the future of digital content processing.