Identifying Call Center Topics with Natural Language Processing
Challenges
- A large financial company handled thousands of daily customer interactions with no systematic understanding of what customers were calling about.
- Decisions on product improvements and retention were made on intuition and manual spot-checks — leaving the business without a reliable foundation.
- The business needed a data-driven view of contact drivers and wanted to understand how call behaviour connected to the customers most likely to churn.
- Working with Danish-language call data added difficulty — most off-the-shelf NLP tooling is built and benchmarked on English, with limited support for Danish and insurance-specific terminology.
- Within the confines of GDPR, this also required strong anonymization techniques.
Approach
- Transcribed call recordings from audio to text to enable NLP analysis.
- Added anonymization algorithm to anonymize calls and filter out sensitive information.
- Integrated with customer and churn data.
- Developed custom tokenization to optimize richness and speed.
- Developed topic modelling (LDA) to identify topics that customers called about.
- Developed named-entity recognition (NER) to identify things and persons.
- Developed sentiment analysis to classify positive or negative topics.
Results
- End-to-end pipeline with transcription, anonymization, processing, NLP analysis and insights.
- Increased and systematic visibility into call center issues and performance.
Technologies & methods
Natural Language Processing (NLP) Sentiment Analysis Topic Modelling (LDA) Named-Entity Recognition (NER) Tokenization Speech-to-Text AI
Description
Finance - Customer Service
May 27, 2026