Home

zenei hal gyujts tuzet automatic speech recognition dataset generation github movie subtitle Transzformátor Könyörtelen téma

voice-activity-detection · GitHub Topics · GitHub
voice-activity-detection · GitHub Topics · GitHub

What was that?” Increasing subtitle accuracy for live broadcasts using  Amazon Transcribe | AWS for M&E Blog
What was that?” Increasing subtitle accuracy for live broadcasts using Amazon Transcribe | AWS for M&E Blog

Sensors | Free Full-Text | Reliability-Based Large-Vocabulary Audio-Visual Speech  Recognition
Sensors | Free Full-Text | Reliability-Based Large-Vocabulary Audio-Visual Speech Recognition

GitHub - khuangaf/ITRI-speech-recognition-dataset-generation: Automatic  Speech Recognition Dataset Generation
GitHub - khuangaf/ITRI-speech-recognition-dataset-generation: Automatic Speech Recognition Dataset Generation

GitHub - PaddlePaddle/PaddleSpeech: Easy-to-use Speech Toolkit including  Self-Supervised Learning model, SOTA/Streaming ASR with punctuation,  Streaming TTS with text frontend, Speaker Verification System, End-to-End  Speech Translation and Keyword ...
GitHub - PaddlePaddle/PaddleSpeech: Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword ...

Electronics | Free Full-Text | Adversarial Attack and Defense Strategies of  Speaker Recognition Systems: A Survey
Electronics | Free Full-Text | Adversarial Attack and Defense Strategies of Speaker Recognition Systems: A Survey

arXiv:1903.00216v1 [cs.CL] 1 Mar 2019
arXiv:1903.00216v1 [cs.CL] 1 Mar 2019

voice-activity-detection · GitHub Topics · GitHub
voice-activity-detection · GitHub Topics · GitHub

OpenAI Whisper — Your speech-to-text AI: History and usage | SuperAnnotate
OpenAI Whisper — Your speech-to-text AI: History and usage | SuperAnnotate

subtitles · GitHub Topics · GitHub
subtitles · GitHub Topics · GitHub

Speech Emotion Recognition Project using Machine Learning
Speech Emotion Recognition Project using Machine Learning

A list of audio datasets for Speech Recognition and other audio related  tasks (both free and not free) : r/datasets
A list of audio datasets for Speech Recognition and other audio related tasks (both free and not free) : r/datasets

PDF) CEASR: A Corpus for Evaluating Automatic Speech Recognition
PDF) CEASR: A Corpus for Evaluating Automatic Speech Recognition

Best Speech Recognition Software 2022
Best Speech Recognition Software 2022

Add Subtitles to Video using AI for free using this Open Source Tool
Add Subtitles to Video using AI for free using this Open Source Tool

Generating automatic video subtitles from any language with Whisper  AutoCaption
Generating automatic video subtitles from any language with Whisper AutoCaption

Information | Free Full-Text | Reconsidering Read and Spontaneous Speech:  Causal Perspectives on the Generation of Training Data for Automatic Speech  Recognition
Information | Free Full-Text | Reconsidering Read and Spontaneous Speech: Causal Perspectives on the Generation of Training Data for Automatic Speech Recognition

Develop Smaller Speech Recognition Models with the NVIDIA NeMo Framework |  NVIDIA Technical Blog
Develop Smaller Speech Recognition Models with the NVIDIA NeMo Framework | NVIDIA Technical Blog

How to Build Domain Specific Automatic Speech Recognition Models on GPUs |  NVIDIA Technical Blog
How to Build Domain Specific Automatic Speech Recognition Models on GPUs | NVIDIA Technical Blog

Facebook's Wav2Vec using Hugging Face's transformer for Speech Recognition  - YouTube
Facebook's Wav2Vec using Hugging Face's transformer for Speech Recognition - YouTube

Thinking out loud, an open-access EEG-based BCI dataset for inner speech  recognition | Scientific Data
Thinking out loud, an open-access EEG-based BCI dataset for inner speech recognition | Scientific Data

GitHub - espnet/espnet: End-to-End Speech Processing Toolkit
GitHub - espnet/espnet: End-to-End Speech Processing Toolkit

GitHub - zats/SpeechRecognition: Generating subtitles for a video in  realtime using SFSpeechRecognizer
GitHub - zats/SpeechRecognition: Generating subtitles for a video in realtime using SFSpeechRecognizer

MovieNet: A Holistic Dataset for Movie Understanding – arXiv Vanity
MovieNet: A Holistic Dataset for Movie Understanding – arXiv Vanity

PDF) Pansori: ASR Corpus Generation from Open Online Video Contents
PDF) Pansori: ASR Corpus Generation from Open Online Video Contents

The State of Multilingual AI
The State of Multilingual AI

Benchmarking Top Open Source Speech Recognition Models: Whisper, Facebook  wav2vec2, and Kaldi - Deepgram Blog ⚡️
Benchmarking Top Open Source Speech Recognition Models: Whisper, Facebook wav2vec2, and Kaldi - Deepgram Blog ⚡️