All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

Using Named-Entity Recognition to Filter Sensitive Information in Free Text for External APIs

By

thunderbong

9mo ago· 2 min readen

Summary

The article discusses methods for filtering sensitive information in free text before sending it to external APIs, particularly for chatbots and LLMs. It explains that while parameter-based filtering works for structured data, free text requires more sophisticated approaches. The piece highlights the limitations of regex-based filtering and introduces named-entity recognition (NER) as a more effective solution for identifying and redacting various types of sensitive information that cannot be captured with simple pattern matching.

Key quotes

· 4 pulled
Filtering the entire string may not be an option if an external API needs to process the value
You could use a regex to filter sensitive information, but that won't capture everything
Fortunately, named-entity recognition (NER) can be used to identify sensitive information
Not all sensitive information can be captured with a regex
Snippet from the RSS feed
Automatically filter sensitive information before sending it to external services or APIs, such as chatbots and LLMs.

You might also wanna read