[PLUG] Current state of Linux voice recognition

Larry Brigman larry.brigman at gmail.com
Wed Jun 28 19:30:48 UTC 2017


Actually it is fact. Amateur radio operators and radio to digital voice use
a 2k wide bandwidth filter.
IRLP project uses 8khz sampling.

On Jun 28, 2017 8:51 AM, "Richard Owlett" <rowlett at cloud85.net> wrote:

> On 06/28/2017 09:54 AM, Larry Brigman wrote:
> > Not really answering your question but providing important facts
> > about the domain.
> > Human voice frequency range tops out at 8khz.  Normal speech is around
> > 2-3khz.  Any soundcard that samples at 48khz will work just fine.
>
> <chuckle> That's the theory that's been around "forever".
> I'm in possession of a factoid that prompts me to do some research
> needing high resolution at high sample rates.
>
> For more than 60 years I've had significant high frequency hearing loss
> in one ear due to repeated ear infections in childhood. Forty years ago
> I consulted a major medical center in Boston at the prompting of a RN
> friend. The formal diagnosis was "nerve deafness" based on comparing my
> tests with headphones and a "bone conduction" setup. I personally
> suspect there are also complications attributable to scar tissue on the
> eardrum.
>
> I date from the era when all males attending "land grant institutions"
> were required to take two years of ROTC. I planned to take Advanced
> ROTC. I barely passed the physical due to my hearing loss. I have
> *personal* knowledge that 'high frequency' components are important as I
> must use my "good ear" when expecting to understand female speakers.
>
> If I recall my terminology [and spelling ;] correctly, consider
> "sibilants" and "fricatives" [possibly also "full stops"].
>
>
> >
> > On Wed, Jun 28, 2017 at 6:52 AM, Richard Owlett <rowlett at cloud85.net>
> wrote:
> >
> >> Up until about ten years ago, while still using Windows, I was following
> >> voice recognition. At that time the only option was commercial product
> >> which cost too much and wasn't a good match for my desires at that time.
> >>
> >> Time has passed and I'm retired. What I'm looking for would be a large
> >> vocabulary, single speaker, continuous speech system. The application
> >> would be straight text note taking - I'm a slow and lousy typist.
> >>
> >> I'm already investigating good microphones with good A/D resolution and
> >> preferably high sample rate [I've ideas on pre-processing I would like
> >> to experiment with].
> >>
> >> Can anyone recommend some survey articles &/or competent current
> reviews.
> >> TIA
>
>
>
>
> _______________________________________________
> PLUG mailing list
> PLUG at lists.pdxlinux.org
> http://lists.pdxlinux.org/mailman/listinfo/plug
>



More information about the PLUG mailing list