Accéder directement au contenu Accéder directement à la navigation
Article dans une revue

Talisman: a JavaScript archive of fuzzy matching, information retrieval and record linkage building blocks

Abstract : Information retrieval and record linkage have always relied on crafty and heuristical routines aimed at implementing what is often called fuzzy matching. Indeed, even if fuzzy logic feels natural to humans, one needs to find various strategies to coerce computersinto acknowledging that strings, for instance, are not always strictly delimited. But if some of those techniques, such as the Soundex phonetic algorithm invented at the beginning of the 20th century, are still well known and used, a lot of them were unfortunately lost to time. As such, theTalisman JavaScript library aims at being an archive of a wide variety of tech-niques that have been used throughout computer sciences’ history to perform fuzzy comparisons between words, names, sentences etc. Thus, even if Talisman obviously provides state-of-the-art functions that are still being used in an industrial context, it also aims at being a safe harbor for less known or clunkier techniques, for historical and archival purposes.
Liste complète des métadonnées

https://hal-sciencespo.archives-ouvertes.fr/hal-03237144
Contributeur : Spire Sciences Po Institutional Repository <>
Soumis le : mercredi 26 mai 2021 - 15:23:51
Dernière modification le : dimanche 4 juillet 2021 - 03:24:55

Lien texte intégral

Identifiants

Collections

Citation

Guillaume Plique. Talisman: a JavaScript archive of fuzzy matching, information retrieval and record linkage building blocks. Journal of Open Source Software, Open Journals, 2020, pp.1 - 2. ⟨10.21105/joss.02405⟩. ⟨hal-03237144⟩

Partager

Métriques

Consultations de la notice

18