On-Demand Workshop

Anonymize Financial Data with a Fine-Tuned SLM 

Explore how to develop an automated, customizable system for anonymizing financial data using machine learning.

Recorded on April 1, 2024

In this workshop, Gretel and Google showcase an end-to-end machine learning approach to automatically anonymize sensitive customer data in financial documents. 

We use synthetic customer data to teach a small language model how to anonymize sensitive fields like names, account numbers, transactions details, etc. This avoids manual data collection and labeling, and can be easily adapted and customized to support customers’ specific data patterns. The end result is an automated, customizable system for anonymizing financial data that improves over time.

Key topics covered include:

  • Generating synthetic training data
  • Fine-tuning language models
  • Deploying through a Streamlit application
  • Enabling human-in-the-loop validation

Register to watch this on-demand workshop

Presented by


Dr. Ali Arsanjani, Ph.D.

Director AI/ML Engineering, Google


Alex Watson

Co-Founder and Chief Product Officer, Gretel


Maarten Van Segbroeck, Ph.D.

Principal Applied Science, Gretel

Discord Join us in the Synthetic Data Community Discord  https://gretel.ai/discord