Conversation
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
| For technical documentation, please see the | ||
| [NeMo Framework User Guide](https://docs.nvidia.com/nemo-framework/user-guide/latest/playbooks/index.html). |
There was a problem hiding this comment.
@pzelasko Is this the correct link to the latest documentation?
There was a problem hiding this comment.
I think this is the most recent for nightly docs, but not sure about the latest release.
https://docs.nvidia.com/nemo/speech/nightly/starthere/intro.html
| @@ -1,2 +1,2 @@ | |||
| [](http://www.repostatus.org/#active) | |||
| [](https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/main/) | |||
There was a problem hiding this comment.
@pzelasko Do we still use readthedocs or should I update this badge or remove it?
pzelasko
left a comment
There was a problem hiding this comment.
Looks great, left a few comments.
| for 9 languages(En, Es, De, Fr, Vi, It, Zh, Hi, Ja). Try out [the demo](https://huggingface.co/nvidia/magpie_tts_multilingual_357m). | ||
| - 2026-01: [Nemotron-Speech-Streaming](https://huggingface.co/nvidia/nemotron-speech-streaming-en-0.6b) has been | ||
| released: One checkpoint that enables users to pick their optimal point on the latency-accuracy Pareto curve! Try | ||
| out [the demo](https://huggingface.co/spaces/nvidia/nemotron-speech-streaming-en-0.6b). |
There was a problem hiding this comment.
Let's add:
2025-08: [Parakeet V3](https://huggingface.co/nvidia/parakeet-tdt-0.6b-v3) and [Canary V2](https://huggingface.co/nvidia/canary-1b-v2) have been released with speech recognition and translation support for 25 European languages.
2025-06: [Canary-Qwen-2.5B](https://huggingface.co/nvidia/canary-qwen-2.5b) has been released with record-setting 5.63% WER on English Open ASR Leaderboard.
|
|
||
| - **Scalability** - NeMo 2.0 seamlessly scaling large-scale experiments across thousands of GPUs using [NeMo-Run](https://github.com/NVIDIA/NeMo-Run), a powerful tool designed to streamline the configuration, execution, and management of machine learning experiments across computing environments. | ||
| NVIDIA NeMo Speech is built for researchers and PyTorch developers working on Speech models including Automatic Speech | ||
| Recognition (ASR) and Text to Speech (TTS). It is designed to help you efficiently create, customize, and deploy new |
There was a problem hiding this comment.
| Recognition (ASR) and Text to Speech (TTS). It is designed to help you efficiently create, customize, and deploy new | |
| Recognition (ASR), Text to Speech (TTS), and Speech LLMs. It is designed to help you efficiently create, customize, and deploy new |
| For technical documentation, please see the | ||
| [NeMo Framework User Guide](https://docs.nvidia.com/nemo-framework/user-guide/latest/playbooks/index.html). |
There was a problem hiding this comment.
I think this is the most recent for nightly docs, but not sure about the latest release.
https://docs.nvidia.com/nemo/speech/nightly/starthere/intro.html
What does this PR do ?
Update README
Collection: None
Changelog
PR Type: