Trace
Compare clause text with NLP similarity measures to identify countries that play an influential role in language dissemination.
POLICY INTELLIGENCE · TREATY RESEARCH
A research program that turns a global corpus of bilateral investment treaties into evidence: where treaty language travels, which country connections may emerge, and how a tailored LLM can help teams investigate, compare and draft with care.
01 / THE DECISION
Bilateral investment treaties are not isolated documents. Clauses are adapted, replicated and negotiated across countries and decades — shaping the policy space available to governments.
This program makes those patterns visible, then extends the analysis with a bespoke language-model interface that keeps the underlying treaty text in view. The aim is not an opaque answer, but a faster route from question to reviewable evidence.
02 / THE SYSTEM
Three complementary ways to understand treaty language and its possible next moves.
Compare clause text with NLP similarity measures to identify countries that play an influential role in language dissemination.
Represent countries and BITs as a network, combining topology and economic features to estimate likely future connections.
Use tailored prompts and a treaty vector database so the LLM can analyse clause stance, compare precedents and support grounded drafting.
Return evidence, citations and drafting alternatives to expert users, with human judgement remaining central to every conclusion.
03 / EVIDENCE
The research starts with published NLP and graph-learning methods, then builds toward a more flexible, context-aware alternative to purely rule-based treaty analysis.
BITs analysed for health-safeguard clause dissemination
countries represented in the BIT network analysis
XGBoost link-prediction accuracy in the published study
Published studies use EDIT treaty data. Reported performance applies to the study design and data; models should be revalidated before any policy or operational use.
04 / RESEARCH FOUNDATION
The first study quantifies national influence in clause dissemination. The second predicts potential BIT formation using network and country-level signals. Together they provide the foundation for a customised LLM system that brings retrieval, stance analysis and research assistance into one evidence-grounded workflow.
PLOS ONE · 2024
Introduced the Influence (INF) metric using text similarity to identify countries central to the diffusion of defensive, neutral and offensive health-related treaty language.
JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE · 2025
Combined country-network structure, node embeddings and economic indicators to predict links representing potential future BITs with health inclusion.
DATA FOUNDATION
A comprehensive, machine-readable full-text corpus of international investment agreements that enables retrieval, comparison and reproducible analysis.
05 / LLM WORKBENCH
A treaty-specialised LLM is designed to retrieve relevant provisions before it responds. It can support nuanced stance or sentiment analysis, compare drafting patterns, generate bounded alternatives and help researchers explore potential policy implications.
06 / PROJECT PRINCIPLE
Treaty analysis becomes more powerful when every answer remains traceable.Discuss a policy AI project ↗