Automatic Speech Recognition Variance: Consecutive Runs of Low-Resource Languages in Whisper

Home > Archive > 2024 > Volume 14 Number 2 (2024) >

IJML 2024 Vol.14(2): 43-47
DOI: 10.18178/ijml.2024.14.2.1156

Laurel Lord* and Mark Newman

Department of Data Sciences, Harrisburg University of Science and Technology, Harrisburg, USA
Email: lalord@my.harrisburgu.edu (L.L.); mnewman@harrisburgu.edu (M.N.)
^*Corresponding author

Manuscript received September 15, 2023; revised November 26, 2023; accepted January 19, 2024; published April 26, 2024

Abstract—This study employs OpenAI’s Whisper to explore the manifestation of variance in an Automatic Speech Recognition (ASR) system. Three trained languages from Whisper’s current offerings (English, French, and Haitian Kreyòl) and one untrained (Saint Lucian Kwéyòl) completed thirty consecutive runs each, across five model sizes. Etymologically complex yet orthographically simple, mutually intelligible languages may challenge ASR system capabilities. However, a phonetically similar trained language model generated approximate phonetic transcripts for an untrained one. Despite implicit variance hurdles like non-determinism and data deficiencies, ASR systems may aid in documenting high-orality, low-resource languages.

Keywords—automatic speech recognition, creole, low-resource languages, Whisper

[PDF]

Cite: Laurel Lord and Mark Newman, "Automatic Speech Recognition Variance: Consecutive Runs of Low-Resource Languages in Whisper," International Journal of Machine Learning vol. 14, no. 2, pp. 43-47, 2024.

Copyright © 2024 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

PREVIOUS PAPER

Automatic Path Generation of USV Using Reinforcement Learning for Complex Coastal Areas

NEXT PAPER

Machine Learning Based Cancer Classification Using Gene Expression Data

General Information

E-ISSN: 2972-368X
Abbreviated Title: Int. J. Mach. Learn.
Frequency: Quarterly
DOI: 10.18178/IJML
Editor-in-Chief: Dr. Lin Huang
Executive Editor: Ms. Cherry L. Chen
Abstracing/Indexing: Google Scholar, Crossref, ProQuest, Electronic Journals Library, CNKI.
E-mail: editor@ijml.org
APC: 500USD

Home

About IJML

Editorial Board

Author Guideline

Editor Guideline

Reviewer Guideline

Special Issues

Archive

Home > Archive > 2024 > Volume 14 Number 2 (2024) >

Automatic Speech Recognition Variance: Consecutive Runs of Low-Resource Languages in Whisper

General Information

Article Metrics in Dimensions