Pilot Testing of an Information Extraction (IE) Prototype for Legal Research

Thumbnail Image

Date

2020-06-30

Authors

Scholtz, Brenda
Padayachy, Thashen
Adewoyin, Oluwande

Journal Title

Journal ISSN

Volume Title

Publisher

LINK Centre, University of the Witwatersrand (Wits), Johannesburg

Abstract

This article presents findings from pilot testing of elements of an information extraction (IE) prototype designed to assist legal researchers in engaging with case law databases. The prototype that was piloted seeks to extract, from legal case documents, relevant and accurate information on cases referred to (CRTs) in the source cases. Testing of CRT extraction from 50 source cases resulted in only 38% (n = 19) of the extractions providing an accurate number of CRTs. In respect of the prototype’s extraction of CRT attributes (case title, date, journal, and action), none of the 50 extractions produced fully accurate attribute information. The article outlines the prototype, the pilot testing process, and the test findings, and then concludes with a discussion of where the prototype needs to be improved.

Description

Keywords

information retrieval (IR), information extraction (IE), natural language processing (NLP), legal cases, document databases, source cases, cases referred to (CRTs)

Citation

Scholtz, B., Padayachy, T., & Adewoyin, O. (2020). Pilot testing of an information extraction (IE) prototype for legal research.The African Journal of Information and Communication (AJIC), 25, 1-20. https://doi.org/10.23962/10539/29192

Endorsement

Review

Supplemented By

Referenced By