LLM Pipeline for Legal Coding
I designed and evaluated a two-stage LLM pipeline to support the coding of legal texts on freedom of expression. The goal was not full automation, but building a system that meaningfully reduces the burden on human researchers.
- Processed a global corpus of 160+ laws across 60+ countries
- Built a recall–verification pipeline combining GPT models, retrieval, and structured outputs
- Showed tradeoffs between model performance, cost, and reliability in real-world workflows