Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Requirements Analysis of Large Policy Corpora



Alden A. Dima, Aaron Massey


Regulators, policy makers, and consumers are interested in proactively identifying services with acceptable or compliant data use policies, privacy policies, and terms of service. Academic requirements engineering researchers and legal scholars have developed qualitative, manual approaches to conducting requirements analysis of policy documents to identify concerns and compare services against preferences or standards. In this research, we develop and present an approach to conducting large-scale, qualitative, prospective analyses of policy documents with respect to the wide-variety of normative concerns found in policy documents. Our approach uses techniques from natural language processing, including topic modeling and summarization. We evaluate our approach in an exploratory case study that attempts to replicate a manual legal analysis of roughly 200 privacy policies from seven domains in a semi-automated fashion at a larger scale. Our findings suggest that this approach is promising for some concerns.
Proceedings Title
54th Hawaii International Conference on System Sciences, HICSS 2021
Conference Dates
January 5-8, 2021
Conference Location
Kauai, HI, US


text analysis, large document collections, privacy policies


Dima, A. and Massey, A. (2021), Requirements Analysis of Large Policy Corpora, 54th Hawaii International Conference on System Sciences, HICSS 2021, Kauai, HI, US, [online],, (Accessed April 17, 2024)
Created January 5, 2021, Updated March 31, 2022