Features

Features

Aspose.Words FOSS for Python is a document conversion and text extraction library for Word documents. The entire public API is built around the Document class.


Supported Input Formats

The Document class loads files from these formats automatically based on file extension:

FormatExtensions
Word Document.docx, .doc
Rich Text.rtf
Plain Text.txt
Markdown.md

Output Formats and Save Options

Call Document.save() with a SaveFormat constant for quick conversion, or pass a save-options object for fine-grained control.

OutputSaveFormat ConstantSave Options Class
PDFSaveFormat.PDFPdfSaveOptions — configure compliance level, JPEG quality
MarkdownSaveFormat.MARKDOWNMarkdownSaveOptions — control underline formatting export
Plain TextSaveFormat.TEXT

For code examples and save-options configuration details, see Core Management.


Text Extraction

Document.get_text() returns the full plain-text content of any loaded document without writing to disk.


Image Support

Documents with embedded images can be converted to all supported output formats. The conversion pipeline preserves image content through the export process.


API Summary

Class / MethodRole
DocumentLoad documents, convert formats, extract text
SaveFormatOutput format constants (PDF, MARKDOWN, TEXT)
PdfSaveOptionsPDF export configuration
MarkdownSaveOptionsMarkdown export configuration
 English