Machine learning detection of person names, locations, and organizations using a pre-trained NER model that runs entirely in your browser.
All machine learning processing happens 100% in your browser. Your data never leaves your device. The ML model is downloaded once and cached locally in your browser's IndexedDB storage.
No server. No cloud. No data collection.
Full names, titles, and name variations that pattern-based rules might miss. E.g., "Dr. Sarah Johnson", "Steve Bull"
Cities, countries, regions, and place names. E.g., "London", "Silicon Valley", "New York"
Company names, institutions, and organizations. E.g., "Microsoft", "NHS", "University of Cambridge"
ML Name Detection uses state-of-the-art Named Entity Recognition (NER) technology:
| Model | Download Size | Speed | Accuracy | Best For |
|---|---|---|---|---|
| DistilBERT NER | ~250 MB | Fast | Good | General use (recommended) |
| BERT Base NER | ~420 MB | Slower | Best | Maximum accuracy |
| BERT Base NER (uncased) | ~420 MB | Slower | Best | Case-insensitive text |
Open the sidebar and expand the Extras section. Check the ML Name Detection checkbox.
Choose from DistilBERT (faster, smaller) or BERT Base (more accurate, larger). DistilBERT is recommended for most use cases.
Click Download Model. This only happens once — the model is cached in your browser for future sessions.
Click Analyze. ML detection runs automatically alongside pattern-based rules, finding names that regex can't catch.
Check the Smart Suggestions panel for ML-detected entities. Click Scrub to anonymize everything.
ML detection is most valuable when:
No installation required. Open LogScrub, enable ML detection, and start analyzing.
Launch LogScrub