openai whisper - Unable to perfrom alignment for word-level timestamp in WhisperX run locally on NVIDIA GForce RTX 3050 - Stack

I wanted to generate world level highlighting for my videos using WhisperX as is shown herewhere the

I wanted to generate world level highlighting for my videos using WhisperX as is shown here where the subtitle shows up and words get highlighted as they are spoken.

I have installed the dependencies in as given in the GitHub page . However when I run the command on Command Prompt in Windows

C:\Users\profe>whisperx "C:mypathtofile\test audio 2.mp3" --model medium --align_model WAV2VEC2_ASR_LARGE_LV60K_960H --highlight_words True --verbose True

the Command Prompt stops at Performing alignment... as is shown here

the files that are generated as output are:

The JSON file has the world level timestamps. Things I tired doing:

  1. changed the alignment model
  2. change the model
  3. added --device CUDA & --verbose True None of it seemed to let it proceed beyond the "Performing Alignment..." part. Any leads on this will be greatly appreciated. Thank you

发布者:admin,转转请注明出处:http://www.yc00.com/questions/1744327642a4568729.html

相关推荐

发表回复

评论列表(0条)

  • 暂无评论

联系我们

400-800-8888

在线咨询: QQ交谈

邮件:admin@example.com

工作时间:周一至周五,9:30-18:30,节假日休息

关注微信