AI Case Study
LovData tests automatic redaction of data in court case documents using machine learning and natural language processing
LovData is implementing AI methods to sort through public Norwegian court documents and ensure names have been redacted where necessary, in accordance with Norwegian law.
Industry
Professional Services
Legal
Project Overview
"Our main solution used today is an in-house solution written in Fortran back in 2003. Until that time, anonymisation was a 100% manual job. We will replace all Fortran-solutions with new in-house Java solutions [and] we are testing different AI technologies, which might help us with this. Our current solution, based on lists with names, is challenged when it comes to picking up all kinds of misspellings of names and names that are not already added to our lists. Our lists include all citizens registered in Norway. However, we receive an increasing numbers of court decision involving non-Norwegians, and it is laborious to find all these names in court decisions. For a while we have been experimenting with NER software to extract names not recognised by our current in-house solution. Recent test have been promising, but due to our high quality demands we are not quite ready to implement this solution. [Also] being a small language, none of these models are pre-trained to understand Norwegian. Having to train the models before testing them has been a time consuming task."
Reported Results
Research; results currently unavailable
Technology
Function
Information Technology
Data Management
Background
"LovData is a private foundation created in 1981 by the Ministry of Justice and the Faculty of Law in Oslo and has been anonymising court cases involving minors since the beginning. Court decisions are public information per se. However, for criminal cases, cases involving minors, family cases involving married or divorced parties and disabled persons, we have statutes that prohibit us from publishing the court decisions un-anonymised. In addition, Norwegian statutes on privacy prohibit us in general from publishing decisions with sensitive personal data un-anonymised. If the court consider it impossible to publish a court decision without identifying the involved parties, it will prohibit us from publishing it."
Benefits
Data
Legal court documents