eTBLAST
A major contributor to this article appears to have a close connection with its subject. (August 2011) |
eTBLAST was a free text-similarity service now defunct. It was initially developed by Alexander Pertsemlidis and Harold “Skip” Garner in 2005 at The University of Texas Southwestern Medical Center. It offered access to the following databases:
- MEDLINE
- National Institutes of Health (NIH)
- CRISP
- Institute of Physics (IOP)
- Wikipedia
- arXiv
- NASA technical reports
- Virginia Tech class descriptions
- others of clinical interest
eTBLAST searched citation databases[1] and databases containing full-text such as PUBMED. It compared a user’s natural-text query with target databases utilizing a hybrid-search algorithm. The algorithm consisted of a low-sensitivity, weighted, keyword-based first pass followed by a novel second pass based on sentence alignment. eTBLAST later became a web-based service of The Innovation Laboratory at the Virginia Bioinformatics Institute.
The text-similarity engine studied duplicate publications and potential plagiarism in biomedical literature. eTBLAST received thousands of random samples of Medline abstracts for a large-scale study. Those with the highest similarity were assessed then entered into an on-line database. The work revealed several trends including an increasing rate of duplication in the biomedical literature, according to prominent scientific journals Bioinformatics,[2]Anaesthesia and Intensive Care,[3] Clinical Chemistry,[4] Urologic oncology,[5] Nature,[6] and Science.[7]
See also
[edit]- BLAST (Basic Local Alignment Search Tool)
- Natural language processing
- Medical literature retrieval
References
[edit]- ^ Lewis, J; Ossowski, S; Hicks, J; Errami, M; Garner, HR (2006). "Text similarity: An alternative way to search MEDLINE". Bioinformatics. 22 (18): 2298–304. doi:10.1093/bioinformatics/btl388. PMID 16926219.
- ^ Errami, M; Hicks, JM; Fisher, W; Trusty, D; Wren, JD; Long, TC; Garner, HR (2007). "Deja vu a study of duplicate citations in Medline". Bioinformatics. 24 (2): 243–9. doi:10.1093/bioinformatics/btm574. PMID 18056062.
- ^ Loadsman, JA; Garner, HR; Drummond, GB (2008). "Towards the elimination of duplication in Anaesthesia and Intensive Care". Anaesthesia and Intensive Care. 36 (5): 643–5. doi:10.1177/0310057X0803600502. PMID 18853580.
- ^ George, AC; Long, TC; Garner, HR (2010). "Quaere Verum". Clinical Chemistry. 56 (4): 673–4. doi:10.1373/clinchem.2009.130468. PMID 20093558.
- ^ Garner, HR (2011). "Combating unethical publications with plagiarism detection services". Urologic Oncology. 29 (1): 95–9. doi:10.1016/j.urolonc.2010.09.016. PMC 3035174. PMID 21194644.
- ^ Errami, M; Garner, H (2008). "A tale of two citations". Nature. 451 (7177): 397–9. Bibcode:2008Natur.451..397E. doi:10.1038/451397a. PMID 18216832. S2CID 4358525.
- ^ Long, TC; Errami, M; George, AC; Sun, Z; Garner, HR (2009). "Responding to Possible Plagiarism". Science. 323 (5919): 1293–4. doi:10.1126/science.1167408. PMID 19265004. S2CID 28467385.