2. Input formats - The diversity of Octopus

Octopus offers an impressive range of supported input formats, allowing you to process almost any type of document. Whether it's text, tables, images, videos or even programming code - Octopus is equipped for all requirements. Here is an overview of the most important input formats:

  1. Audio formats
    MP3, WAV: Ideal for processing voice recordings or podcasts.
  2. Video formats
    MP4, AVI: Supports the analysis and processing of video files.
  3. Vector-based formats
    PDF, PostScript (PS): Perfect for high-quality further processing.
  4. Continuous text formats
    HTML, EPUB, DOCX: For websites, e-books and text documents.
  5. Table formats
    Excel, CSV: For processing data and tables.
  6. Raster image formats
    JPG, BMP: For analysing and OCR processing of images.
  7. Vector formats
    SVG, AI: For scalable graphics and design files.
  8. Programming code
    XSLT, CSS: Supports the processing and customisation of code.
  9. XML formats
    DITA, JATS, DocBook: For structured documents and publications.
  10. Metadata formats
    LOM, DC, Marc21, ONIX: For managing and enriching metadata.
  11. Presentation formats
    PPTX: For processing presentations.

With Octopus, you can not only process these formats, but also transform, enrich and optimise them into other formats. The platform offers you maximum flexibility and efficiency, regardless of the format in which your content is available.

Octopus - your solution for limitless document processing!