Pulse · NVIDIA/NeMo

July 8, 2025 – July 15, 2025

55 Unresolved conversations

Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.

Add support of Llama4 VLM PTQ
#13567 commented on Jul 15, 2025 • 16 new comments
autoConfigurator performance calculation and config name update
#14071 commented on Jul 15, 2025 • 13 new comments
Canary2 with NFA
#14121 commented on Jul 9, 2025 • 2 new comments
Load master weights from checkpoint
#14072 commented on Jul 14, 2025 • 2 new comments
S2S data to audio-input/text-output multimodal conversation converter
#14124 commented on Jul 15, 2025 • 1 new comment
Enable packed sequence in SFT with Chat dataset
#14116 commented on Jul 12, 2025 • 1 new comment
Add calculate_per_token_loss support for Context Parallel
#14065 commented on Jul 16, 2025 • 0 new comments
chore(🤖): Bump `NVIDIA/Megatron-LM` to `6b70889...` (2025-06-29)
#14062 commented on Jul 13, 2025 • 0 new comments
chore(🤖): Bump `NVIDIA/Megatron-LM` to `03f7793...` (2025-06-29)
#14061 commented on Jul 13, 2025 • 0 new comments
chore(🤖): Bump `NVIDIA/Megatron-LM` to `03f7793...` (2025-06-28)
#14054 commented on Jul 12, 2025 • 0 new comments
chore(🤖): Bump `NVIDIA/Megatron-LM` to `6b70889...` (2025-06-28)
#14053 commented on Jul 12, 2025 • 0 new comments
Streaming Sortformer Tutorial, postprocessing yaml files and tests
#14052 commented on Jul 16, 2025 • 0 new comments
Add timestamps option to streaming inference script
#14030 commented on Jul 11, 2025 • 0 new comments
Aaftabv row wise der
#14024 commented on Jul 10, 2025 • 0 new comments
Gaod/dpskv3/perf analysis
#14015 commented on Jul 10, 2025 • 0 new comments
Toggle skip
#13958 commented on Jul 15, 2025 • 0 new comments
Remove unnecessary padding
#13957 commented on Jul 16, 2025 • 0 new comments
chore(🤖): Bump `NVIDIA/Megatron-LM` to `03f7793...` (2025-06-30)
#14066 commented on Jul 14, 2025 • 0 new comments
chore(🤖): Bump `NVIDIA/Megatron-LM` to `6b70889...` (2025-06-30)
#14067 commented on Jul 14, 2025 • 0 new comments
Change to enable full iteration CUDA graph for LLMs
#14077 commented on Jul 12, 2025 • 0 new comments
chore(🤖): Bump `NVIDIA/Megatron-LM` to `1d4a1f7...` (2025-07-01)
#14078 commented on Jul 15, 2025 • 0 new comments
chore(🤖): Bump `NVIDIA/Megatron-LM` to `8a416d0...` (2025-07-01)
#14079 commented on Jul 15, 2025 • 0 new comments
Small fix for clustering diarizer
#14093 commented on Jul 15, 2025 • 0 new comments
Remove nemo1 multimodal and vision
#14095 commented on Jul 15, 2025 • 0 new comments
Nfa lhotse tar
#14096 commented on Jul 15, 2025 • 0 new comments
Add interface to use high priority stream for communicator groups
#14151 commented on Jul 9, 2025 • 0 new comments
Fix model training/eval state after PTL validation loop
#14152 commented on Jul 11, 2025 • 0 new comments
chore(🤖): Bump `NVIDIA/Megatron-LM` to `4560888...` (2025-07-09)
#14164 commented on Jul 9, 2025 • 0 new comments
[Request] Benchmark tok/sec for Deepseek V3 pretrain recipe
#14158 commented on Jul 9, 2025 • 0 new comments
Loading .nemo model throws unhandled exception
#13858 commented on Jul 11, 2025 • 0 new comments
Empty output with speech_to_text_buffered_infer_rnnt.py
#13797 commented on Jul 11, 2025 • 0 new comments
export_ckpt failed due to AssertionError: dtype mismatch between source and target state dicts
#13455 commented on Jul 11, 2025 • 0 new comments
Q: Forced alignment with streaming
#13863 commented on Jul 12, 2025 • 0 new comments
DuplexS2SModel
#13902 commented on Jul 13, 2025 • 0 new comments
Adjusting "global_batch_size" and "micro_batch_size" has no impact on how long each training step takes when using HFAutoModel.
#13887 commented on Jul 13, 2025 • 0 new comments
TDT Head Stagnation in parakeet-tdt_ctc-110m Fine-tuning on Persian Data
#14140 commented on Jul 13, 2025 • 0 new comments
hugging face saved model inference
#13891 commented on Jul 14, 2025 • 0 new comments
How to pretrain from scratch the Qwen3 4B model but with MoE?
#13845 commented on Jul 14, 2025 • 0 new comments
How to pretrain from scratch the Qwen 3 4B or 7B dense model (with my own data)?
#13844 commented on Jul 14, 2025 • 0 new comments
eval_beamsearch_ngram_ctc.py won't run
#13064 commented on Jul 16, 2025 • 0 new comments
SDE with GPU acceleration
#8657 commented on Jul 14, 2025 • 0 new comments
Add safetensor option when saving and restoring models
#11549 commented on Jul 13, 2025 • 0 new comments
Feature/wsd scheduler
#12611 commented on Jul 9, 2025 • 0 new comments
Transducer with Transformer-Decoder (GPT-like)
#13030 commented on Jul 10, 2025 • 0 new comments
add new tokenizer system support
#13090 commented on Jul 11, 2025 • 0 new comments
Add Parakeet Hybrid RNNT CTC BPE Model with target language support
#13360 commented on Jul 11, 2025 • 0 new comments
Add CallbackGroup & Metadata factory function
#13437 commented on Jul 14, 2025 • 0 new comments
IPL callback and two scripts
#13671 commented on Jul 10, 2025 • 0 new comments
Bump vllm from 0.8.5.post1 to 0.9.0 in /requirements
#13758 commented on Jul 13, 2025 • 0 new comments
Llama Nemotron VL
#13819 commented on Jul 13, 2025 • 0 new comments
Experimental Magpie Decoder Only Model
#13833 commented on Jul 16, 2025 • 0 new comments
Sketch dist-ckpt content versioning
#13839 commented on Jul 15, 2025 • 0 new comments
feat: print expert groups on megatron init
#13874 commented on Jul 10, 2025 • 0 new comments
ASR models knowledge distillation example
#13939 commented on Jul 16, 2025 • 0 new comments
[Draft] Downsampled main transformer
#13952 commented on Jul 14, 2025 • 0 new comments

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

July 8, 2025 – July 15, 2025

Overview

Could not load contribution data

48 Pull requests merged by 20 people

27 Pull requests opened by 15 people

11 Issues closed by 4 people

10 Issues opened by 9 people

55 Unresolved conversations

Insights: NVIDIA/NeMo

July 8, 2025 – July 15, 2025

Overview

Could not load contribution data

48 Pull requests merged by 20 people

27 Pull requests opened by 15 people

11 Issues closed by 4 people

10 Issues opened by 9 people

55 Unresolved conversations