Synonym extraction and abbreviation expansion with ensembles of semantic spaces, Journal of Biomedical Semantics
Por um escritor misterioso
Last updated 13 novembro 2024
Background Terminologies that account for variation in language use by linking synonyms and abbreviations to their corresponding concept are important enablers of high-quality information extraction from medical texts. Due to the use of specialized sub-languages in the medical domain, manual construction of semantic resources that accurately reflect language use is both costly and challenging, often resulting in low coverage. Although models of distributional semantics applied to large corpora provide a potential means of supporting development of such resources, their ability to isolate synonymy from other semantic relations is limited. Their application in the clinical domain has also only recently begun to be explored. Combining distributional models and applying them to different types of corpora may lead to enhanced performance on the tasks of automatically extracting synonyms and abbreviation-expansion pairs. Results A combination of two distributional models – Random Indexing and Random Permutation – employed in conjunction with a single corpus outperforms using either of the models in isolation. Furthermore, combining semantic spaces induced from different types of corpora – a corpus of clinical text and a corpus of medical journal articles – further improves results, outperforming a combination of semantic spaces induced from a single source, as well as a single semantic space induced from the conjoint corpus. A combination strategy that simply sums the cosine similarity scores of candidate terms is generally the most profitable out of the ones explored. Finally, applying simple post-processing filtering rules yields substantial performance gains on the tasks of extracting abbreviation-expansion pairs, but not synonyms. The best results, measured as recall in a list of ten candidate terms, for the three tasks are: 0.39 for abbreviations to long forms, 0.33 for long forms to abbreviations, and 0.47 for synonyms. Conclusions This study demonstrates that ensembles of semantic spaces can yield improved performance on the tasks of automatically extracting synonyms and abbreviation-expansion pairs. This notion, which merits further exploration, allows different distributional models – with different model parameters – and different types of corpora to be combined, potentially allowing enhanced performance to be obtained on a wide range of natural language processing tasks.
Synonym extraction and abbreviation expansion with ensembles of
Semi-supervised medical entity recognition: A study on Spanish and
Exploring patterns in dictionary definitions for synonym
prophetnet.tokenizer · microsoft/prophetnet-large-uncased-squad-qg
NTNU Open: Synonym Extraction and Abbreviation Expansion with
PDF) PLOD: An Abbreviation Detection Dataset for Scientific
Assigning clinical codes with data-driven concept representation
Machine Knowledge: Creation and Curation of Comprehensive
US10417344B2 - Exemplar-based natural language processing
Handbook of Artificial Intelligence in Biomedical Engineering
Automatically refining synonym extraction results: Cleaning and
Recomendado para você
-
Applicants Synonyms. Similar word for Applicants.13 novembro 2024
-
Synonym & Antonym Dictionary – Allganize13 novembro 2024
-
Alli User Guide - Synonym & Antonym Dictionary13 novembro 2024
-
Power Thesaurus on X: Someone who feels13 novembro 2024
-
Synonyms for People - TED IELTS13 novembro 2024
-
Which one is correct, 'Write the synonyms of the following words13 novembro 2024
-
Synonyms and Antonyms Dictionary -Lesson 23: Contestant (noun)13 novembro 2024
-
Synonym phrases -Experience for resume in 202313 novembro 2024
-
941% Traffic Increase Exploiting the Synonyms SEO Ranking Technique13 novembro 2024
-
CANDIDATE definition in American English13 novembro 2024
você pode gostar
-
goofy ahh edit : r/dankmemes13 novembro 2024
-
Some games download instantly and then doesn't run · Issue #148 · ValveSoftware/steam-for-linux · GitHub13 novembro 2024
-
GTA 6 Vazamento Revela Gameplay, Local, Lançamento e Mais (PT)13 novembro 2024
-
Como assistir a BBC online de graça no Brasil13 novembro 2024
-
Carrinhos Hot Wheels - Pacote Com 5 Carros - Hot Trucks - M em Promoção na Americanas13 novembro 2024
-
human M (Alphabet Lore) Lowercase by eddsworldX209 on DeviantArt13 novembro 2024
-
Pokémon Go Giratina (Origin) Shiny✨trade 30 Day Ultra Friend Or Same Day Trade13 novembro 2024
-
Crime socioambiental transformado em lucro imobiliário: o caso da13 novembro 2024
-
5 jogos de tabuleiro e cartas para receber os amigos em casa13 novembro 2024
-
80s SIERRA Truck light colors apparel - Texas State - T-Shirt13 novembro 2024