Developer Guide
Aspose.Note FOSS for Python is a free, open-source library for reading Microsoft OneNote .one section files without any dependency on Microsoft Office. It provides a clean public API under the aspose.note package, modeled after the Aspose.Note for .NET interface. The library is suitable for document automation, content indexing, data extraction pipelines, and archival workflows.
This developer guide covers the full public API surface available in version 26.3.1, with runnable code examples for every major feature.
Document Loading
Load a .one file from a file path or a binary stream. The Document class is the entry point for all operations.
Load from a file path
Load a OneNote section file from disk by passing the file path string directly to the Document constructor:
from aspose.note import Document
doc = Document("MyNotes.one")Load from a binary stream
Useful when reading from cloud storage, HTTP responses, or in-memory buffers:
from pathlib import Path
from aspose.note import Document
with Path("MyNotes.one").open("rb") as f:
doc = Document(f)Load options
Use LoadOptions to set optional parameters such as history loading before passing the options object to the Document constructor:
from aspose.note import Document, LoadOptions
opts = LoadOptions()
opts.LoadHistory = True # Include page history in the DOM
doc = Document("MyNotes.one", opts)Note:
DocumentPasswordexists onLoadOptionsfor API compatibility, but encrypted documents are not supported. Attempting to load an encrypted file raisesIncorrectPasswordException.
Document Structure (DOM)
The OneNote document model is a tree rooted at Document. Every node exposes parent and child accessors for traversal:
Document
└── Page (0..n)
├── Title
│ ├── TitleText (RichText)
│ ├── TitleDate (RichText)
│ └── TitleTime (RichText)
└── Outline (0..n)
└── OutlineElement (0..n)
├── RichText
├── Image
├── Table
│ └── TableRow
│ └── TableCell
│ └── RichText / Image
└── AttachedFileEvery node exposes ParentNode and a Document property that walks up to the root. Composite nodes support child iteration, FirstChild, LastChild, AppendChildLast, InsertChild, RemoveChild, and GetChildNodes(Type).
Iterating Pages
Pages are the direct children of Document. Iterate them directly or use GetChildNodes:
from aspose.note import Document, Page
doc = Document("MyNotes.one")
for page in doc:
title = page.Title.TitleText.Text if page.Title and page.Title.TitleText else "(untitled)"
author = page.Author or "(unknown)"
print(f" {title} [by {author}]")Page metadata:
| Property | Type | Description |
|---|---|---|
Title | Title | None | Page title block |
Author | str | None | Author string |
CreationTime | datetime | None | Page creation time |
LastModifiedTime | datetime | None | Last modification time |
Level | int | None | Sub-page indent level |
Text Extraction
Extract all plain text
Call GetChildNodes(RichText) to collect every RichText node in a single recursive traversal, then join the .Text strings:
from aspose.note import Document, RichText
doc = Document("MyNotes.one")
all_text = [rt.Text for rt in doc.GetChildNodes(RichText) if rt.Text]
print("\n".join(all_text))Inspect formatting runs
Each RichText contains a list of TextRun segments. Each run carries its own TextStyle:
from aspose.note import Document, RichText
doc = Document("FormattedNotes.one")
for rt in doc.GetChildNodes(RichText):
for run in rt.TextRuns:
style = run.Style
flags = []
if style.IsBold: flags.append("bold")
if style.IsItalic: flags.append("italic")
if style.IsHyperlink: flags.append(f"link={style.HyperlinkAddress}")
print(f"{run.Text!r:40s} [{', '.join(flags)}]")Extract hyperlinks
Check run.Style.IsHyperlink on each TextRun to identify hyperlink runs and read Style.HyperlinkAddress for the target URL:
from aspose.note import Document, RichText
doc = Document("MyNotes.one")
for rt in doc.GetChildNodes(RichText):
for run in rt.TextRuns:
if run.Style.IsHyperlink and run.Style.HyperlinkAddress:
print(run.Text, "->", run.Style.HyperlinkAddress)Image Extraction
Use GetChildNodes(Image) to collect all embedded images from the document and write the raw bytes of each image to disk:
from aspose.note import Document, Image
doc = Document("MyNotes.one")
for i, img in enumerate(doc.GetChildNodes(Image), start=1):
name = img.FileName or f"image_{i}.bin"
with open(name, "wb") as f:
f.write(img.Bytes)
print(f"Saved {name} ({img.Width}x{img.Height})")Image properties: FileName, Bytes, Width, Height, AlternativeTextTitle, AlternativeTextDescription, HyperlinkUrl, Tags.
Table Parsing
Use GetChildNodes(Table) to iterate all tables in the document, then traverse TableRow and TableCell nodes to extract cell text:
from aspose.note import Document, Table, TableRow, TableCell, RichText
doc = Document("MyNotes.one")
for table in doc.GetChildNodes(Table):
print("Column widths:", [col.Width for col in table.Columns])
for r, row in enumerate(table.GetChildNodes(TableRow), start=1):
cells = row.GetChildNodes(TableCell)
row_text = [
" ".join(rt.Text for rt in cell.GetChildNodes(RichText)).strip()
for cell in cells
]
print(f"Row {r}:", row_text)Attached Files
Use GetChildNodes(AttachedFile) to retrieve every embedded file attachment and write its bytes to disk under the original filename:
from aspose.note import Document, AttachedFile
doc = Document("NotesWithAttachments.one")
for i, af in enumerate(doc.GetChildNodes(AttachedFile), start=1):
name = af.FileName or f"attachment_{i}.bin"
with open(name, "wb") as f:
f.write(af.Bytes)
print(f"Saved: {name}")Tags and Numbered Lists
Inspect NoteTag items
Iterate the .Tags list on each RichText or Image node to read the Label and Icon properties for all OneNote tags applied in the source document:
from aspose.note import Document, RichText, Image, Table
doc = Document("TaggedNotes.one")
for rt in doc.GetChildNodes(RichText):
for tag in rt.Tags:
print(f"RichText tag: {tag.Label} icon={tag.Icon}")
for img in doc.GetChildNodes(Image):
for tag in img.Tags:
print(f"Image tag: {tag.Label}")Inspect numbered lists
Check the NumberList property on each OutlineElement node to read the format of any numbered list associated with that outline element:
from aspose.note import Document, OutlineElement
doc = Document("NumberedNotes.one")
for oe in doc.GetChildNodes(OutlineElement):
nl = oe.NumberList
if nl:
print(f"format={nl.Format!r}")DocumentVisitor Pattern
Use DocumentVisitor to implement a visitor that walks the entire document tree:
from aspose.note import Document, DocumentVisitor, Page, RichText, Image
class ContentCounter(DocumentVisitor):
def __init__(self):
self.pages = 0
self.texts = 0
self.images = 0
def VisitPageStart(self, page: Page) -> None:
self.pages += 1
def VisitRichTextStart(self, rt: RichText) -> None:
self.texts += 1
def VisitImageStart(self, img: Image) -> None:
self.images += 1
doc = Document("MyNotes.one")
counter = ContentCounter()
doc.Accept(counter)
print(f"Pages: {counter.pages}, Texts: {counter.texts}, Images: {counter.images}")PDF Export
PDF export requires the optional ReportLab dependency. Install it with:
pip install "aspose-note[pdf]"Basic PDF export
Pass SaveFormat.Pdf to Document.Save() with a file path to write the entire document as a PDF file:
from aspose.note import Document, SaveFormat
doc = Document("MyNotes.one")
doc.Save("output.pdf", SaveFormat.Pdf)PDF export with options
Create a PdfSaveOptions instance and pass it to Document.Save() to control export behavior, or pass a BytesIO buffer instead of a path to capture the bytes in memory:
import io
from aspose.note import Document, SaveFormat
from aspose.note.saving import PdfSaveOptions
doc = Document("MyNotes.one")
##With save options
opts = PdfSaveOptions()
doc.Save("output.pdf", opts)
##Save to in-memory stream
buf = io.BytesIO()
doc.Save(buf, PdfSaveOptions())
pdf_bytes = buf.getvalue()Note:
PdfSaveOptions.PageIndexandPageCountfields exist for API compatibility. In v26.3.1, the save operation exports the entire document regardless of these values.
Current Limitations
| Area | Status |
|---|---|
Reading .one files | Supported |
| PDF export (via ReportLab) | Supported |
Writing back to .one | Not implemented |
| Encrypted documents | Not supported (raises IncorrectPasswordException) |
| HTML / image / ONE save formats | Declared for API compatibility; raise UnsupportedSaveFormatException |
Available Guides
- Features Overview: full feature list with evidence
- Getting Started: prerequisites, installation, and first steps
- Installation: pip install and optional dependencies