NIST

soundex

(algorithm)

Definition: An algorithm to code surnames phonetically by reducing them to the first letter and up to three digits, where each digit is one of six consonant sounds. This reduces matching problems from different spellings.

Generalization (I am a kind of ...)
phonetic coding algorithm.

See also double metaphone, Jaro-Winkler, Caverphone, NYSIIS, Levenshtein distance.

Note: The algorithm was devised to code names in US census records. The standard algorithm works best on European names. Variants have been devised for names from other cultures.

Author: PEB

Implementation

(C). Apache codec implementations of soundex, Metaphone, and Double Metaphone (Java). (Visual Basic).

More information

Overview of Soundex.


Go to the Dictionary of Algorithms and Data Structures home page.

If you have suggestions, corrections, or comments, please get in touch with Paul Black.

Entry modified 22 November 2021.
HTML page formatted Mon Nov 22 06:32:06 2021.

Cite this as:
Paul E. Black, "soundex", in Dictionary of Algorithms and Data Structures [online], Paul E. Black, ed. 22 November 2021. (accessed TODAY) Available from: https://www.nist.gov/dads/HTML/soundex.html