Chin P. Cheng, Jiayi Pan, Gloria T. Lau, Kincho H. Law, Albert W. Jones
Increasingly, taxonomies are being developed for a wide variety of industrial domains and specific applications within those domains. These taxonomies attempt to represent formally the vocabularies commonly used by domain practitioners. These formal representations have the potential to automate information retrieval and improve decision-making. Those decisions must comply with existing government regulations and codes of practice, which are not always known to the practitioners. Although both are available in digital form online, practitioners cannot retrieve easily relevant regulations and codes that apply to particular decisions. To address this problem, we propose an approach to relate regulations with existing industry-specific taxonomies. The mapping from a single taxonomy to a single regulation is a trivial keyword matching task. In this paper, we examine techniques to map a single taxonomy to multiple regulations, as well as to map multiple taxonomies to a single regulation. Those techniques include Cosine similarity, Jaccard coefficient and market-basket analysis. These techniques provide a metric that measures the similarity between concepts from different taxonomies. We describe these techniques and metrics, and evaluate them using examples from the building industry. These examples show the potential regulatory benefits from the mapping between various taxonomies and regulations.
Proceedings of the 9th Annual International Conference on Digital Government Research