Member-only story

Web Speech API: What Works, What Doesn’t, and How to Improve It by Linking It to a GPT Language Model

Part of a series on how modern AI and other technologies could assist more efficient human-computer interactions

LucianoSphere (Luciano Abriata, PhD)
TDS Archive
15 min readDec 6, 2023

--

Photo by palesa on Unsplash

I am of the idea that modern technologies enable today much simpler and natural human-computer interactions than what current software actually proposes. Indeed, I think technologies are ripe enough that we could just go without traditional interfaces and move forward with a revolution in user experience.

Large language models have certainly triggered one stage of this revolution, particularly in how we ask for information. However, I think technologies can still provide much more. For example, we are still largely stuck with flat screens despite the decreasing costs of VR headsets; we are still using mouse, keyboard, and touch gestures to operate devices despite the level of advancement of technologies like eye-gazing, speech-recognition and body limb tracking; we are still reading out a lot despite great advances in speech synthesis.

I feel current technologies are ripe enough to offer human-computer interactions almost like those in Star Trek (if you don’t know…

--

--

TDS Archive
TDS Archive

Published in TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

LucianoSphere (Luciano Abriata, PhD)
LucianoSphere (Luciano Abriata, PhD)

Written by LucianoSphere (Luciano Abriata, PhD)

https://www.lucianoabriata.com | Scientific writing, technology integrator, programming, biotech, bioinformatics.| Have a job for me? Contact me in ES FR EN IT

Responses (7)