Cargando…

Semi-automated fact-checking of nucleotide sequence reagents in biomedical research publications: The Seek & Blastn tool

Nucleotide sequence reagents are verifiable experimental reagents in biomedical publications, because their sequence identities can be independently verified and compared with associated text descriptors. We have previously reported that incorrectly identified nucleotide sequence reagents are charac...

Descripción completa

Detalles Bibliográficos
Autores principales: Labbé, Cyril, Grima, Natalie, Gautier, Thierry, Favier, Bertrand, Byrne, Jennifer A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6396917/
https://www.ncbi.nlm.nih.gov/pubmed/30822319
http://dx.doi.org/10.1371/journal.pone.0213266
_version_ 1783399341479690240
author Labbé, Cyril
Grima, Natalie
Gautier, Thierry
Favier, Bertrand
Byrne, Jennifer A.
author_facet Labbé, Cyril
Grima, Natalie
Gautier, Thierry
Favier, Bertrand
Byrne, Jennifer A.
author_sort Labbé, Cyril
collection PubMed
description Nucleotide sequence reagents are verifiable experimental reagents in biomedical publications, because their sequence identities can be independently verified and compared with associated text descriptors. We have previously reported that incorrectly identified nucleotide sequence reagents are characteristic of highly similar human gene knockdown studies, some of which have been retracted from the literature on account of possible research fraud. Because of the throughput limitations of manual verification of nucleotide sequences, we developed a semi-automated fact checking tool, Seek & Blastn, to verify the targeting or non-targeting status of published nucleotide sequence reagents. From previously described and unknown corpora of 48 and 155 publications, respectively, Seek & Blastn correctly extracted 304/342 (88.9%) and 1066/1522 (70.0%) nucleotide sequences and a predicted targeting/ non-targeting status. Seek & Blastn correctly predicted the targeting/ non-targeting status of 293/304 (96.4%) and 988/1066 (92.7%) of the correctly extracted nucleotide sequences. A total of 38/39 (97.4%) or 31/79 (39.2%) Seek & Blastn predictions of incorrect nucleotide sequence reagent use were correct in the two literature corpora. Combined Seek & Blastn and manual analyses identified a list of 91 misidentified nucleotide sequence reagents, which could be built upon through future studies. In summary, incorrect nucleotide sequence reagents represent an under-recognized source of error within the biomedical literature, and fact checking tools such as Seek & Blastn may help to identify papers and manuscripts affected by these errors.
format Online
Article
Text
id pubmed-6396917
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-63969172019-03-08 Semi-automated fact-checking of nucleotide sequence reagents in biomedical research publications: The Seek & Blastn tool Labbé, Cyril Grima, Natalie Gautier, Thierry Favier, Bertrand Byrne, Jennifer A. PLoS One Research Article Nucleotide sequence reagents are verifiable experimental reagents in biomedical publications, because their sequence identities can be independently verified and compared with associated text descriptors. We have previously reported that incorrectly identified nucleotide sequence reagents are characteristic of highly similar human gene knockdown studies, some of which have been retracted from the literature on account of possible research fraud. Because of the throughput limitations of manual verification of nucleotide sequences, we developed a semi-automated fact checking tool, Seek & Blastn, to verify the targeting or non-targeting status of published nucleotide sequence reagents. From previously described and unknown corpora of 48 and 155 publications, respectively, Seek & Blastn correctly extracted 304/342 (88.9%) and 1066/1522 (70.0%) nucleotide sequences and a predicted targeting/ non-targeting status. Seek & Blastn correctly predicted the targeting/ non-targeting status of 293/304 (96.4%) and 988/1066 (92.7%) of the correctly extracted nucleotide sequences. A total of 38/39 (97.4%) or 31/79 (39.2%) Seek & Blastn predictions of incorrect nucleotide sequence reagent use were correct in the two literature corpora. Combined Seek & Blastn and manual analyses identified a list of 91 misidentified nucleotide sequence reagents, which could be built upon through future studies. In summary, incorrect nucleotide sequence reagents represent an under-recognized source of error within the biomedical literature, and fact checking tools such as Seek & Blastn may help to identify papers and manuscripts affected by these errors. Public Library of Science 2019-03-01 /pmc/articles/PMC6396917/ /pubmed/30822319 http://dx.doi.org/10.1371/journal.pone.0213266 Text en © 2019 Labbé et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Labbé, Cyril
Grima, Natalie
Gautier, Thierry
Favier, Bertrand
Byrne, Jennifer A.
Semi-automated fact-checking of nucleotide sequence reagents in biomedical research publications: The Seek & Blastn tool
title Semi-automated fact-checking of nucleotide sequence reagents in biomedical research publications: The Seek & Blastn tool
title_full Semi-automated fact-checking of nucleotide sequence reagents in biomedical research publications: The Seek & Blastn tool
title_fullStr Semi-automated fact-checking of nucleotide sequence reagents in biomedical research publications: The Seek & Blastn tool
title_full_unstemmed Semi-automated fact-checking of nucleotide sequence reagents in biomedical research publications: The Seek & Blastn tool
title_short Semi-automated fact-checking of nucleotide sequence reagents in biomedical research publications: The Seek & Blastn tool
title_sort semi-automated fact-checking of nucleotide sequence reagents in biomedical research publications: the seek & blastn tool
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6396917/
https://www.ncbi.nlm.nih.gov/pubmed/30822319
http://dx.doi.org/10.1371/journal.pone.0213266
work_keys_str_mv AT labbecyril semiautomatedfactcheckingofnucleotidesequencereagentsinbiomedicalresearchpublicationstheseekblastntool
AT grimanatalie semiautomatedfactcheckingofnucleotidesequencereagentsinbiomedicalresearchpublicationstheseekblastntool
AT gautierthierry semiautomatedfactcheckingofnucleotidesequencereagentsinbiomedicalresearchpublicationstheseekblastntool
AT favierbertrand semiautomatedfactcheckingofnucleotidesequencereagentsinbiomedicalresearchpublicationstheseekblastntool
AT byrnejennifera semiautomatedfactcheckingofnucleotidesequencereagentsinbiomedicalresearchpublicationstheseekblastntool