Using Azure Speech To Text Service where I'm giving input as memory stream but getting error "NOMATCH: Speech c

admin•2025-04-18 13:16:43•questions•阅读1

I'm using Microsoft.CognitiveServices.Speech speech to text service where I'm giving input as

I'm using Microsoft.CognitiveServices.Speech speech to text service where I'm giving input as MemoryStream instead of file input using a custom api. However I get the error "NOMATCH: Speech could not be recognized". The code works when I'm using a file input where I read the file and give the input as FileStream. Here is the code I'm using:

    public static async Task<string> RecognizeSpeechFromStreamAsync(Stream audioStream)
    {
        try
        {
            byte channels = 1;
            byte bitsPerSample = 16;
            uint samplesPerSecond = 16000; // or 8000  
            var audioFormat = AudioStreamFormat.GetWaveFormatPCM(samplesPerSecond, bitsPerSample, channels);

            var contosoStream = new ContosoAudioStream(audioStream);
            var audioConfig = AudioConfig.FromStreamInput(contosoStream, audioFormat);

            var speechConfig = SpeechConfig.FromSubscription(speechKey, speechRegion);
            speechConfig.SpeechRecognitionLanguage = "en-US";

            using (var speechRecognizer = new SpeechRecognizer(speechConfig, audioConfig))
            {
                Console.WriteLine("Starting speech recognition from stream...");
                var speechRecognitionResult = await speechRecognizer.RecognizeOnceAsync();

                if (speechRecognitionResult.Reason == ResultReason.RecognizedSpeech)
                {
                    Console.WriteLine($"RECOGNIZED: Text={speechRecognitionResult.Text}");
                    return speechRecognitionResult.Text;
                }
                else if (speechRecognitionResult.Reason == ResultReason.NoMatch)
                {
                    Console.WriteLine($"NOMATCH: Speech could not be recognized.");
                    return null; // Or an appropriate error message  
                }
                else if (speechRecognitionResult.Reason == ResultReason.Canceled)
                {
                    var cancellation = CancellationDetails.FromResult(speechRecognitionResult);
                    Console.WriteLine($"CANCELED: Reason={cancellation.Reason}");

                    if (cancellation.Reason == CancellationReason.Error)
                    {
                        Console.WriteLine($"CANCELED: ErrorCode={cancellation.ErrorCode}");
                        Console.WriteLine($"CANCELED: ErrorDetails={cancellation.ErrorDetails}");
                        Console.WriteLine($"CANCELED: Did you set the speech resource key and region values?");
                        // Consider throwing an exception here to propagate the error  
                    }
                    return null; // Or an appropriate error message  
                }
                else
                {
                    Console.WriteLine($"Unexpected result reason: {speechRecognitionResult.Reason}");
                    return null; // Or an appropriate error message  
                }
            }
        }
        catch (Exception ex)
        {
            Console.Error.WriteLine($"Exception during speech recognition: {ex.Message}");
            return null; // Or throw the exception, depending on your error handling strategy  
        }
    }
}

public class ContosoAudioStream : PullAudioInputStreamCallback
{
    private BinaryReader _reader;
    private int _chunkSize;

    public ContosoAudioStream(Stream audioStream, int chunkSize = 1024)
    {
        _reader = new BinaryReader(audioStream);
        _chunkSize = chunkSize;
    }

    public override int Read(byte[] buffer, uint size)
    {
        try
        {
            byte[] tempBuffer = _reader.ReadBytes((int)Math.Min(size, _chunkSize));
            tempBuffer.CopyTo(buffer, 0);
            return tempBuffer.Length;
        }
        catch (EndOfStreamException)
        {
            return 0; // Signal the end of the stream  
        }
        catch (Exception ex)
        {
            Console.Error.WriteLine($"Error reading from stream: {ex.Message}");
            return 0;
        }
    }

    public override void Close()
    {
        _reader?.Close();
        Console.WriteLine("ContosoAudioStream closed.");
    }
}

    public static async Task<string> RecognizeSpeechFromStreamAsync(Stream audioStream)
    {
        try
        {
            byte channels = 1;
            byte bitsPerSample = 16;
            uint samplesPerSecond = 16000; // or 8000  
            var audioFormat = AudioStreamFormat.GetWaveFormatPCM(samplesPerSecond, bitsPerSample, channels);

            var contosoStream = new ContosoAudioStream(audioStream);
            var audioConfig = AudioConfig.FromStreamInput(contosoStream, audioFormat);

            var speechConfig = SpeechConfig.FromSubscription(speechKey, speechRegion);
            speechConfig.SpeechRecognitionLanguage = "en-US";

            using (var speechRecognizer = new SpeechRecognizer(speechConfig, audioConfig))
            {
                Console.WriteLine("Starting speech recognition from stream...");
                var speechRecognitionResult = await speechRecognizer.RecognizeOnceAsync();

                if (speechRecognitionResult.Reason == ResultReason.RecognizedSpeech)
                {
                    Console.WriteLine($"RECOGNIZED: Text={speechRecognitionResult.Text}");
                    return speechRecognitionResult.Text;
                }
                else if (speechRecognitionResult.Reason == ResultReason.NoMatch)
                {
                    Console.WriteLine($"NOMATCH: Speech could not be recognized.");
                    return null; // Or an appropriate error message  
                }
                else if (speechRecognitionResult.Reason == ResultReason.Canceled)
                {
                    var cancellation = CancellationDetails.FromResult(speechRecognitionResult);
                    Console.WriteLine($"CANCELED: Reason={cancellation.Reason}");

                    if (cancellation.Reason == CancellationReason.Error)
                    {
                        Console.WriteLine($"CANCELED: ErrorCode={cancellation.ErrorCode}");
                        Console.WriteLine($"CANCELED: ErrorDetails={cancellation.ErrorDetails}");
                        Console.WriteLine($"CANCELED: Did you set the speech resource key and region values?");
                        // Consider throwing an exception here to propagate the error  
                    }
                    return null; // Or an appropriate error message  
                }
                else
                {
                    Console.WriteLine($"Unexpected result reason: {speechRecognitionResult.Reason}");
                    return null; // Or an appropriate error message  
                }
            }
        }
        catch (Exception ex)
        {
            Console.Error.WriteLine($"Exception during speech recognition: {ex.Message}");
            return null; // Or throw the exception, depending on your error handling strategy  
        }
    }
}

public class ContosoAudioStream : PullAudioInputStreamCallback
{
    private BinaryReader _reader;
    private int _chunkSize;

    public ContosoAudioStream(Stream audioStream, int chunkSize = 1024)
    {
        _reader = new BinaryReader(audioStream);
        _chunkSize = chunkSize;
    }

    public override int Read(byte[] buffer, uint size)
    {
        try
        {
            byte[] tempBuffer = _reader.ReadBytes((int)Math.Min(size, _chunkSize));
            tempBuffer.CopyTo(buffer, 0);
            return tempBuffer.Length;
        }
        catch (EndOfStreamException)
        {
            return 0; // Signal the end of the stream  
        }
        catch (Exception ex)
        {
            Console.Error.WriteLine($"Error reading from stream: {ex.Message}");
            return 0;
        }
    }

    public override void Close()
    {
        _reader?.Close();
        Console.WriteLine("ContosoAudioStream closed.");
    }
}

Share Improve this question edited Mar 26 at 9:59 VLAZ 29.1k9 gold badges63 silver badges84 bronze badges asked Mar 25 at 9:40 Maryam Mirza 52 bronze badges

Add a comment |

1 Answer 1

Sorted by: Reset to default 0

error "NOMATCH: Speech could not be recognized"

I got the same error when I tried with a WAV file with a sample rate of 48,000 Hz.

Use the command below to check the sample rate of your WAV file.

ffmpeg -i <path/to/.wav file>

So, to resolve the issue, I converted my WAV file to 16,000 Hz using the command below and successfully got the speech to text output.

ffmpeg -i "<path/to/.wav file>" -ar 16000 -ac 1 -sample_fmt s16 "<path/to/converted.wav file>"

Code :

using Microsoft.CognitiveServices.Speech;
using Microsoft.CognitiveServices.Speech.Audio;

class Program
{
    private static string speechKey = "<SpeechKey>"; 
    private static string speechRegion = "<SpeechKey>"; 

    static async Task Main(string[] args)
    {
        string filePath = "<path/to/.wav file>"; 
        try
        {
            if (!File.Exists(filePath))
            {
                Console.WriteLine("Error: Audio file not found.");
                return;
            }
            byte[] audioData = File.ReadAllBytes(filePath);
            using (var memoryStream = new MemoryStream(audioData))
            {
                string resultText = await RecognizeSpeechFromStreamAsync(memoryStream);
                Console.WriteLine($"Recognition Result: {resultText}");
            }
        }
        catch (Exception ex)
        {
            Console.WriteLine($"Exception: {ex.Message}");
        }
    }

    public static async Task<string> RecognizeSpeechFromStreamAsync(Stream audioStream)
    {
        try
        {
            byte channels = 1;
            byte bitsPerSample = 16;
            uint samplesPerSecond = 16000; 
            var audioFormat = AudioStreamFormat.GetWaveFormatPCM(samplesPerSecond, bitsPerSample, channels);
            var contosoStream = new ContosoAudioStream(audioStream);
            var audioConfig = AudioConfig.FromStreamInput(contosoStream, audioFormat);
            var speechConfig = SpeechConfig.FromSubscription(speechKey, speechRegion);
            speechConfig.SpeechRecognitionLanguage = "en-US";

            using (var speechRecognizer = new SpeechRecognizer(speechConfig, audioConfig))
            {
                Console.WriteLine("Starting speech recognition from stream...");
                var speechRecognitionResult = await speechRecognizer.RecognizeOnceAsync();
                if (speechRecognitionResult.Reason == ResultReason.RecognizedSpeech)
                {
                    Console.WriteLine($"RECOGNIZED: Text={speechRecognitionResult.Text}");
                    return speechRecognitionResult.Text;
                }
                else if (speechRecognitionResult.Reason == ResultReason.NoMatch)
                {
                    Console.WriteLine($"NOMATCH: Speech could not be recognized.");
                    return null;
                }
                else if (speechRecognitionResult.Reason == ResultReason.Canceled)
                {
                    var cancellation = CancellationDetails.FromResult(speechRecognitionResult);
                    Console.WriteLine($"CANCELED: Reason={cancellation.Reason}");

                    if (cancellation.Reason == CancellationReason.Error)
                    {
                        Console.WriteLine($"CANCELED: ErrorCode={cancellation.ErrorCode}");
                        Console.WriteLine($"CANCELED: ErrorDetails={cancellation.ErrorDetails}");
                        Console.WriteLine($"CANCELED: Did you set the speech resource key and region values?");
                    }
                    return null;
                }
                else
                {
                    Console.WriteLine($"Unexpected result reason: {speechRecognitionResult.Reason}");
                    return null;
                }
            }
        }
        catch (Exception ex)
        {
            Console.Error.WriteLine($"Exception during speech recognition: {ex.Message}");
            return null;
        }
    }
}

public class ContosoAudioStream : PullAudioInputStreamCallback
{
    private BinaryReader _reader;
    private int _chunkSize;
    public ContosoAudioStream(Stream audioStream, int chunkSize = 1024)
    {
        _reader = new BinaryReader(audioStream);
        _chunkSize = chunkSize;
    }

    public override int Read(byte[] buffer, uint size)
    {
        try
        {
            byte[] tempBuffer = _reader.ReadBytes((int)Math.Min(size, _chunkSize));
            tempBuffer.CopyTo(buffer, 0);
            return tempBuffer.Length;
        }
        catch (EndOfStreamException)
        {
            return 0;  
        }
        catch (Exception ex)
        {
            Console.Error.WriteLine($"Error reading from stream: {ex.Message}");
            return 0;
        }
    }
    public override void Close()
    {
        _reader?.Close();
        Console.WriteLine("ContosoAudioStream closed.");
    }
}

Output :

发布者：admin，转转请注明出处：http://www.yc00.com/questions/1744205963a4563103.html

admin

questions
javascript - Flow Types with Promises (Fetch's) - Stack Overflow
I created a Fetch function to consume a JSON API and have defined types for the JSON object. I am confu
admin
28分钟前
00
questions
javascript - Matching Strings to be case insensitive - Stack Overflow
In my application I have status of an event.I have created an object for those status while writing te
admin
27分钟前
20
questions
arrays - Recursive Object mapping in JavaScript - Stack Overflow
I am trying to map an object from another object in JavaScript. Likevar requestObj = {id: ""
admin
26分钟前
20
questions
javascript - Callbacks running twice after AJAX content loaded - Stack Overflow
I have a page loading content with the waypoints infinite scroller plugin. On the success of the AJAX c
admin
21分钟前
10
questions
Use ASP.NET Core Identity without Username field - Stack Overflow
I have been working with .NET projects regularly, and I have always used the Identity package from .NET
admin
21分钟前
00
questions
javascript - Shiny customize selectInputselectizeInput - Stack Overflow
I want my Shiny select input to: Has no label Has customized background colour: #2f2d57Has placeholde
admin
18分钟前
00
questions
Windows 11 reduces Python script performance when terminal is not the active window - Stack Overflow
I have a Windows 11 system configured to run in "Best Performance" mode. I'm running two
admin
17分钟前
00
questions
r - How to look up options set in packages? - Stack Overflow
Looking through ?options I see documentation of options in base R as well as a few default packages lik
admin
14分钟前
00
questions
c# - EF Core enum to lookup table foreign key constraint - Stack Overflow
I have a model with an enum property. That property has a custom enum to string converter defined in th
admin
12分钟前
00
questions
errors - White screen across whole of site including wp-admin
On a MacBook I have downloaded a WP multisite over SFTP using FileZilla. I have also exported copies of all the required
admin
11分钟前
00
questions
python - Raising Error in Function task-parallelized with Ray - Stack Overflow
Starting to try to use Ray to parallelize a number of task-parallel jobs. I.e. each task takes in an ob
admin
10分钟前
00
questions
javascript - How to remove an attribute with PrototypeJS - Stack Overflow
The only thing I'm seeing from google searches is that Element.writeAttribute() - Adds, specifies
admin
9分钟前
10
questions
html - Hideshow elements with javascript - Stack Overflow
I have tried to search for an answer but haven't quite yet found the right one. And I must do this
admin
9分钟前
00
questions
database - delete_option() and update_option() returning false
I know that when delete_option() or update_option() fail to perform their respective database interactions, they return
admin
6分钟前
10
questions
typescript - Getting posts from a Bluesky feed, in Node.js, using the @atprotoapi package? - Stack Overflow
I am wanting to list the posts for any given Bluesky account, using the @atprotoapi (0.14.7) package i
admin
3分钟前
00
questions
javascript - React useState value in Context API always use initial value instead of updated value - Stack Overflow
I'm using Context API to store global value like userId to share between ponents across entire app
admin
3分钟前
10
questions
plugins - Popup Maker nav menu item not working
I am trying to get my popup that I created using Popup Maker to show when clicking on one of my nav menu items.I found
admin
2分钟前
00
questions
Maui App runs in debug mode when attached to Visual Studio, but crashes standalone - Stack Overflow
I've narrowed the issue down to something in my Flyout menu xaml code, but cannot pinpoint the cau
admin
1分钟前
00
questions
javascript - How can I target elements in a table cell using vuejs? - Stack Overflow
I have a table that is dynamically being rendered with data in a readonly input box in each cell.In t
admin
50秒前
00
questions
javascript - How to create requests dialog with multiple recipients selected by app - Stack Overflow
The Sims Social has their own friend selector. And once you select your friends and hit submit, it thro
admin
22秒前
00

发表回复

评论列表（0条）

暂无评论

Using Azure Speech To Text Service where I'm giving input as memory stream but getting error "NOMATCH: Speech c

1 Answer 1

发表回复

评论列表（0条）

联系我们

400-800-8888

Using Azure Speech To Text Service where I&#39;m giving input as memory stream but getting error &quot;NOMATCH: Speech c

1 Answer 1

相关推荐

发表回复

评论列表（0条）

联系我们

400-800-8888

Using Azure Speech To Text Service where I'm giving input as memory stream but getting error "NOMATCH: Speech c