How to Clean Metadata from LibreOffice and OpenOffice ODF Files
That spreadsheet you sent to a client or the draft report you uploaded to a public repository still carries your name. It also holds the total time you spent editing it, the number of times you hit save, and potentially hidden comments you thought were private. If you are using LibreOffice or Apache OpenOffice, which both use the OpenDocument Format (ODF) standard for files like .odt, .ods, and .odp, this data is baked into the file structure by default.
Unlike Microsoft Office, which has a built-in 'Inspect Document' button that does most of the heavy lifting in one click, LibreOffice and OpenOffice require you to manually visit several different menus to strip out personal information. This makes it easy to miss a step. Forgetting to clear tracked changes or hidden sheets can lead to accidental leaks of internal notes, client names, or previous drafts. Here is how to thoroughly clean your documents before sharing them.
Understanding Where ODF Hides Your Data
To clean a file effectively, you need to know where the data lives. An OpenDocument file is not a single blob of code; it is a ZIP archive containing multiple XML files. The most important ones for privacy are meta.xml, which stores general properties like the author and creation date, and content.xml, which holds the actual text along with tracked changes and comments.
When you type your name into the user settings in LibreOffice, that name gets injected into these XML structures every time you save. Even if you delete the visible text, the underlying tags might remain. Understanding this helps explain why simply deleting text isn't enough-you have to actively tell the software to discard the associated metadata.
The Manual Cleaning Checklist for LibreOffice and OpenOffice
Since there is no single 'clean all' button, follow this specific sequence to ensure you don't leave traces behind. Start with the most obvious properties and move to the hidden structural elements.
- Clear User Data Settings: Go to Tools > Options > LibreOffice > User Data. Clear the fields for Name, Initials, and Company. You can leave them blank or use a generic placeholder. This prevents new documents from automatically grabbing your identity.
- Reset Document Statistics: Open your target file and go to File > Properties. On the General tab, click the Reset button. This wipes the total editing time and revision count. Uncheck the box labeled Apply user data so the current document doesn't pull your profile info again upon saving.
- Wipe Description Fields: Still in File > Properties, switch to the Description tab. Delete everything in Title, Subject, Keywords, and Comments. These fields often contain project codes or internal references.
- Remove Custom Properties: Check the Custom Properties tab. Templates or add-ins sometimes inject custom XML here. Delete any entries that look sensitive.
- Accept or Reject Tracked Changes: This is critical. Go to Edit > Changes > Accept or Reject. Review all changes and either accept them (to make them permanent) or reject them (to revert them). Do not just turn off tracking; the history remains in the file until you explicitly resolve it.
- Delete Comments: Comments carry author names and timestamps. Right-click each comment and select Delete. In Writer, enable View > Hidden Paragraphs to ensure no confidential notes are hiding in plain sight.
- Check for Hidden Sheets (Calc): If you are sending a spreadsheet, right-click on a sheet tab and choose Unhide to see if there are other sheets containing raw data or formulas you didn't intend to share. Delete them if necessary.
- Purge Old Versions: Go to File > Versions. If you have saved intermediate versions, they are stored inside the file. Delete them to prevent recipients from seeing earlier drafts.
After completing these steps, save the document. This manual process is tedious but effective for individual files. For organizations handling hundreds of reports, this workflow becomes a bottleneck and increases the risk of human error.
Automating the Process with Browser-Based Tools
If you find yourself cleaning dozens of files, or if you work on a Mac or Linux machine where installing specialized desktop scrubbers is cumbersome, browser-based tools offer a faster alternative. One such option is Vaulternal's document metadata remover.
This tool operates entirely within your browser using WebAssembly. That means the file never leaves your computer. You can verify this by opening your browser's network developer tools while the page loads-no data packets will be sent to a server. It supports ODF formats (.odt, .ods, .odp) alongside standard Office formats. You simply drag the file in, and the tool strips core properties, application properties, and custom XML data instantly. It also offers a view mode so you can inspect exactly what metadata exists before deciding to remove it.
For users concerned about compliance or audit trails, some advanced cleaners provide a JSON export of the removed fields, proving that the sanitization occurred without exposing the document content itself. This approach bridges the gap between the slow manual method and the complex command-line scripts used by developers.
Advanced: Direct XML Editing for Power Users
If you need absolute certainty that every byte of metadata is gone, you can edit the file directly. Since an ODF file is a ZIP archive, rename your report.odt to report.zip and extract it. Open meta.xml in a text editor. Look for tags like <dc:creator>, <meta:initial-creator>, and <meta:editing-duration>. Delete the contents of these tags or the tags themselves.
You should also check manifest.rdf if present, as it can hold RDF metadata links. Be careful here: modifying these files incorrectly can corrupt the document or invalidate digital signatures. Always keep a backup copy before attempting direct XML manipulation. After editing, re-zip the files and rename back to .odt. Re-open in LibreOffice to ensure it still functions correctly.
Preventing Future Leaks with Global Settings
Cleaning existing files is reactive; configuring your office suite is proactive. In LibreOffice, navigate to Tools > Options > Security > Options. Look for the checkbox labeled Remove personal information on saving. Enabling this setting tells LibreOffice to strip user names from changes and comments and reset editing statistics every time you save a file.
This setting significantly reduces the amount of residual data. However, it does not remove the content of comments themselves, nor does it hide hidden paragraphs. It serves as a safety net, not a complete solution. Combine this global setting with the manual checklist above for the best results.
Why This Matters for Compliance and Privacy
In regulated industries like law, healthcare, and government, leaking metadata can have serious consequences. Under regulations like GDPR, unintentionally releasing personally identifiable information (PII) through document properties can be classified as a data breach. Fines can reach up to 4% of global annual turnover for severe violations.
Even outside strict legal frameworks, professional reputation is at stake. Imagine sending a contract to a potential partner, only for them to find a hidden comment saying "Client X overpaid last time." Or a job applicant submitting a resume that still lists their current employer's name in the 'Company' field. These slips happen daily because users assume that what they see on screen is all that is in the file.
Does LibreOffice have a Document Inspector like Microsoft Office?
No, LibreOffice and Apache OpenOffice do not currently have a single-button 'Document Inspector' feature. You must manually clear properties, accept/reject changes, delete comments, and check for hidden content through various menus.
What happens when I click 'Reset' in File > Properties?
The Reset button clears statistical data like total editing time and revision count. It does not remove the author's name, comments, tracked changes, or custom properties. You still need to manually clear those fields.
Can I remove metadata from ODF files without installing software?
Yes. Browser-based tools like Vaulternal's Metadata Remover allow you to clean ODF files locally in your browser. The processing happens via JavaScript and WebAssembly, so the file never uploads to a server.
Where is the author name stored in an ODF file?
In an ODF package, the author name is primarily stored in the meta.xml file under tags like <dc:creator> and <meta:initial-creator>. It may also appear in comments and tracked changes within content.xml.
Does enabling 'Remove personal information on saving' delete my comments?
No. This setting strips the author's name from the comment metadata and resets editing stats, but it does not delete the comment text itself. You must manually delete comments to remove them.