Mining tortured abbreviations from the scientific literature - SIGMA
Pré-Publication, Document De Travail (Preprint/Prepublication) Année : 2023

Mining tortured abbreviations from the scientific literature

Résumé

The 'Problematic Paper Screener' (PPS, WCRI'22, https://doi.org/10.48550/arXiv.2210.04895) supports the human re-assessment of scientific articles flagged as suspicious. The 'tortured detector' tabulates 12k papers containing tortured phrases: established scientific concepts paraphrased with synonyms, such as 'butt-centric waterway' for 'anal canal.' Some abbreviations are even tortured, such as 'Convolutional Brain Organisation (CNN)' for 'Convolutional Neural Network (CNN).' This abstract tackles the following task: discover and classify all abbreviations from any given article: tortured or genuine.
Fichier principal
Vignette du fichier
Tortured_abbreviations.pdf (1.89 Mo) Télécharger le fichier
Origine Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04311600 , version 1 (28-11-2023)
hal-04311600 , version 2 (24-09-2024)
hal-04311600 , version 3 (06-11-2024)

Licence

Identifiants

  • HAL Id : hal-04311600 , version 2

Citer

Alexandre Clausse, Guillaume Cabanac, Pascal Cuxac, Cyril Labbé. Mining tortured abbreviations from the scientific literature. 2023. ⟨hal-04311600v2⟩

Collections

UGA LIG_GLSI_SIGMA
413 Consultations
117 Téléchargements

Partager

More