MagpieTTS refactor by paarthneekhara · Pull Request #15504 · NVIDIA-NeMo/NeMo

paarthneekhara · 2026-03-16T21:12:54Z

This change is mainly done because EasyMagpie will be reusing some shared functionalities with Magpie. So to avoid code duplication, we are moving common things together.

After this, i will raise a separate PR for EasyMagpie changes.

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

blisc · 2026-03-16T21:34:24Z

nemo/collections/tts/models/magpietts.py

-        If the model has a baked context embedding, the context_encoder weights are also excluded
-        since they are no longer needed for inference.
-        """
+    def state_dict(self, destination=None, prefix='', keep_vars=False):


Can you add back the docstring?

added the docstrings.

blisc · 2026-03-16T21:37:14Z

nemo/collections/tts/models/magpietts.py

-
-        _speaker_verification_model is only included in older checkpoints with the older single_encoder_sv_tts
-        model_type that is no longer supported and can likely be removed in a future version.
+    def _get_state_dict_keys_to_exclude(self):


Can you add a docstring to this?

added the docstrings.

blisc · 2026-03-16T21:43:48Z

nemo/collections/tts/modules/magpietts_modules.py

+def remove_bos_token(codes, codes_len, num_tokens=1):
+    codes = codes[:, :, num_tokens:]
+    codes_len = codes_len - num_tokens
+    return codes, codes_len
+
+
+def remove_embedded_bos_token(embedded, embedded_len):
+    embedded = embedded[:, 1:, :]
+    embedded_len = embedded_len - 1
+    return embedded, embedded_len


These two functions look identical. Do we need both?

Yeah, both of them were there earlier and being used. One does the removal in the embedded tensor and one does the removal in code tensor (before embedding). They are also different in their implementation.

blisc · 2026-03-16T21:43:53Z

nemo/collections/tts/modules/magpietts_modules.py

+def remove_eos_token(codes, codes_len):
+    codes_len = codes_len - 1
+    codes = codes[:, :, :-1]
+    mask = get_mask_from_lengths(lengths=codes_len)
+    codes = codes * mask.unsqueeze(1)
+    return codes, codes_len
+
+
+def remove_embedded_eos_token(embedded, embedded_len):
+    """Remove the last token from embedded sequences.
+
+    Args:
+        embedded: (B, T', D)
+    """
+    embedded_len = embedded_len - 1
+    embedded = embedded[:, :-1, :]
+    mask = get_mask_from_lengths(lengths=embedded_len)
+    embedded = embedded * mask.unsqueeze(2)
+    return embedded, embedded_len


These two functions look identical. Do we need both?

Same as the above.

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

cherry pick magpie changes

d901e48

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

github-actions bot added the TTS label Mar 16, 2026

blisc reviewed Mar 16, 2026

View reviewed changes

paarthneekhara added 2 commits March 16, 2026 18:19

add doc string

5f8b8ba

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

Merge branch 'main' into magpietts_refactor_pr

a7b4f4a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MagpieTTS refactor #15504

MagpieTTS refactor #15504
paarthneekhara wants to merge 3 commits intoNVIDIA-NeMo:mainfrom
paarthneekhara:magpietts_refactor_pr

paarthneekhara commented Mar 16, 2026

Uh oh!

blisc Mar 16, 2026

Uh oh!

paarthneekhara Mar 16, 2026

Uh oh!

blisc Mar 16, 2026

Uh oh!

paarthneekhara Mar 16, 2026

Uh oh!

blisc Mar 16, 2026

Uh oh!

paarthneekhara Mar 16, 2026

Uh oh!

blisc Mar 16, 2026

Uh oh!

paarthneekhara Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

paarthneekhara commented Mar 16, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants