AI Case Study

LovData tests automatic redaction of data in court case documents using machine learning and natural language processing

LovData is implementing AI methods to sort through public Norwegian court documents and ensure names have been redacted where necessary, in accordance with Norwegian law.

Industry

Professional Services

Legal

Project Overview

"Our main solution used today is an in-house solution written in Fortran back in 2003. Until that time, anonymisation was a 100% manual job. We will replace all Fortran-solutions with new in-house Java solutions [and] we are testing different AI technologies, which might help us with this. Our current solution, based on lists with names, is challenged when it comes to picking up all kinds of misspellings of names and names that are not already added to our lists. Our lists include all citizens registered in Norway. However, we receive an increasing numbers of court decision involving non-Norwegians, and it is laborious to find all these names in court decisions. For a while we have been experimenting with NER software to extract names not recognised by our current in-house solution. Recent test have been promising, but due to our high quality demands we are not quite ready to implement this solution. [Also] being a small language, none of these models are pre-trained to understand Norwegian. Having to train the models before testing them has been a time consuming task."

Reported Results

Research; results currently unavailable

Technology

Function

Information Technology

Data Management

Background

"LovData is a private foundation created in 1981 by the Ministry of Justice and the Faculty of Law in Oslo and has been anonymising court cases involving minors since the beginning. Court decisions are public information per se. However, for criminal cases, cases involving minors, family cases involving married or divorced parties and disabled persons, we have statutes that prohibit us from publishing the court decisions un-anonymised. In addition, Norwegian statutes on privacy prohibit us in general from publishing decisions with sensitive personal data un-anonymised. If the court consider it impossible to publish a court decision without identifying the involved parties, it will prohibit us from publishing it."

Benefits

Data

Legal court documents