Workshop
Pre-conference workshop on Monday 8 June.
Free-text healthcare data and Trusted Research Environments (TREs)
DARE UK (Data and Analytics Research Environments UK) has recently awarded three project catalysts to explore next-generation capabilities for the use of free-text data within Trusted Research Environments (TREs). These projects aim to develop early-stage prototypes that test new ideas to support secure data research and analytics, focusing on healthcare. Additionally, DARE UK also funds a community working group called SAFETEXT, which will develop guidelines and protocols for safe and responsible use of de-identified and synthetic healthcare free-text data in AI development.
In this full-day workshop, the project representatives will present how the needs and opportunities for the healthcare data research community and public have shaped the work. The workshop will also provide an opportunity for the wider community to feedback on the outcomes, and outline future collaboration and expectations. The detailed agenda will follow shortly.
Projects involved

STAR-TRE (Safe and Trustworthy Assessment of Risk in TREs for Sensitive Free-Text Access) will address one of the most significant gaps in secure data research: the safe use of sensitive free-text data, such as clinical notes and social care records. By developing scalable, language-model-enabled tools and transparent risk assessment methods, the project aims to help researchers and data custodians understand when and how free-text can be used responsibly — without undermining privacy or public trust.

FORTRESS-TeHR (Federated, Open and Reliable TREs for Synthetic Textual Healthcare Records) will explore how synthetic clinical text can be generated and validated for safe use in TREs. Combining differential privacy with strong public and regulatory engagement, the project will test whether synthetic free-text data can meaningfully support research and federated learning while reducing privacy risks.

TRExt (TRE Text Analytics) will explore approaches to enable TREs to convert sensitive unstructured text into structured, analysable formats suitable for federated analytics. Using only anonymised and synthetic data, the project will build reusable pipelines that allow researchers to analyse text safely, opening up new possibilities across health, justice, and social research.

SAFETEXT (Community-led Protocols for the Safe and Responsible Use of De-identified and Synthetic Healthcare Text for AI Development) is a working group that will establish best practices to ensure trust and transparency while developing AI technologies that rely on learning from de-identified and synthetic healthcare free-text data.
Workshop organisers
- Franz Gruber, University of Edinburgh
- Goran Nenadic, University of Manchester
- Yamiko Msosa, King’s College London