Remove PII from Microsoft Word and LibreOffice documents
LogScrub processes Word documents by extracting text, anonymizing PII, and repackaging the document with sanitized content. The original formatting, styles, and structure are preserved.
EMPLOYMENT AGREEMENT
This agreement is entered into between
ACME Corporation and John Michael Smith
(SSN: 123-45-6789).
Employee Contact Information:
Email: john.smith@personal-email.com
Phone: (555) 123-4567
Address: 742 Evergreen Terrace
Springfield, IL 62701
EMPLOYMENT AGREEMENT
This agreement is entered into between
ACME Corporation and [NAME-1]
(SSN: [SSN-1]).
Employee Contact Information:
Email: [EMAIL-1]
Phone: [PHONE-1]
Address: [ADDRESS-1]
[CITY-1], [STATE-1] [ZIP-1]
Meeting Notes - January 15, 2024 Attendees: Sarah Johnson, Mike Chen Action Items: • Sarah to email client at client@bigcorp.com by Friday • Mike to call vendor at 415-555-0199 • Review server logs from 192.168.1.50
Meeting Notes - January 15, 2024 Attendees: [NAME-1], [NAME-2] Action Items: • [NAME-1] to email client at [EMAIL-1] by Friday • [NAME-2] to call vendor at [PHONE-1] • Review server logs from [IP-1]
Word documents often contain hidden metadata with sensitive information:
LogScrub focuses on visible text content. For complete metadata removal, consider using your word processor's built-in "Inspect Document" feature alongside LogScrub.
LogScrub processes text within tables, maintaining cell structure:
| Name | Email | Phone | |-------------|-------------------|--------------| | Alice Brown | alice@corp.com | 555-111-2222 | | Bob Green | bob.g@company.net | 555-333-4444 |
| Name | Email | Phone | |----------|------------|------------| | [NAME-1] | [EMAIL-1] | [PHONE-1] | | [NAME-2] | [EMAIL-2] | [PHONE-2] |
See the original document rendered alongside the extracted/anonymized text before downloading.
Same values get the same replacement throughout the document. If "John Smith" appears in the header and body, both become [NAME-1].
Add your own regex patterns for company-specific identifiers, project codes, or internal reference numbers.
.docx — Microsoft Word 2007 and later.odt — OpenDocument Text (LibreOffice, OpenOffice)Note: Legacy .doc format is not supported. Please save as .docx first.
Drop your DOCX or ODT file into LogScrub to get started.
Launch LogScrub