NOTICE: Due to a lapse in annual appropriations, most of this website is not being updated. Learn more.
Form submissions will still be accepted but will not receive responses at this time. Sections of this site for programs using non-appropriated funds (such as NVLAP) or those that are excepted from the shutdown (such as CHIPS and NVD) will continue to be updated.
An official website of the United States government
Here’s how you know
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
Secure .gov websites use HTTPS
A lock (
) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.
Jacob Collard, Valeria de Paiva, Brendan Fong, Eswaran Subrahmanian
Abstract
We investigate some different systems for extracting mathematical entities from texts in the mathematical field of category theory, as a first step for constructing a mathematical knowledge graph. We consider four different term extractors and compare their results. This small experiment showcases some of the issues with the construction and evaluation of specific domain knowledge graphs. We also make available two open corpora in research mathematics, in particular in category theory: a small corpus of 755 abstracts from the journal TAC (3188 sentences), and a bigger corpus from the nLab community wiki (15,000 sentences).
Proceedings Title
Proceedings of the 29th International Conference on Computational Linguistics
Conference Dates
October 12-17, 2022
Conference Location
Gyeongju, KR
Conference Title
29th International Conference on Computational Linguistics
Collard, J.
, de Paiva, V.
, Fong, B.
and Subrahmanian, E.
(2022),
Extracting Mathematical Concepts from Text, Proceedings of the 29th International Conference on Computational Linguistics, Gyeongju, KR, [online], https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=934895
(Accessed October 9, 2025)