forked from NVIDIA/NeMo
-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
updating #7
Merged
Merged
updating #7
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
* fix world size and don't mock Signed-off-by: Maanu Grover <[email protected]> * cleanup global state Signed-off-by: Maanu Grover <[email protected]> * check app state instead Signed-off-by: Maanu Grover <[email protected]> * fix syntax nemo logger test Signed-off-by: Maanu Grover <[email protected]> --------- Signed-off-by: Maanu Grover <[email protected]>
…ress` (#11016) * Revert "[NeMo-UX] Use custom `BatchProgress` class which does not restore states (#10383)" This reverts commit b5798de. * make megatron sampler return the total number of batches in the dataset Signed-off-by: ashors1 <[email protected]> --------- Signed-off-by: ashors1 <[email protected]>
* initial commit Signed-off-by: Piotr Kaminski <[email protected]> * create Quantizer for NeMo 2.0 Signed-off-by: Piotr Kaminski <[email protected]> * refactor Signed-off-by: Piotr Kaminski <[email protected]> * Call quantize on an unwrapped mcore model Signed-off-by: Piotr Kaminski <[email protected]> * Apply isort and black reformatting Signed-off-by: Laplasjan107 <[email protected]> * Add tests, adjust unwrapping Signed-off-by: Piotr Kaminski <[email protected]> * Apply isort and black reformatting Signed-off-by: Laplasjan107 <[email protected]> * fix export Signed-off-by: Piotr Kaminski <[email protected]> * Apply isort and black reformatting Signed-off-by: Laplasjan107 <[email protected]> * Apply isort and black reformatting Signed-off-by: artbataev <[email protected]> * Fix output_path argument for HF import Signed-off-by: Piotr Kamiński <[email protected]> * fix fabric ckpt loading Signed-off-by: Piotr Kaminski <[email protected]> * Apply isort and black reformatting Signed-off-by: Laplasjan107 <[email protected]> * code review suggestions Signed-off-by: Piotr Kaminski <[email protected]> * Apply isort and black reformatting Signed-off-by: Laplasjan107 <[email protected]> * remove unused import Signed-off-by: Piotr Kaminski <[email protected]> * use cnn dataset in github ci Signed-off-by: Piotr Kaminski <[email protected]> * applied code review Signed-off-by: Piotr Kaminski <[email protected]> * code review changes Signed-off-by: Piotr Kaminski <[email protected]> * Apply isort and black reformatting Signed-off-by: Laplasjan107 <[email protected]> * simplify interface for data iterator Signed-off-by: Piotr Kaminski <[email protected]> * Apply isort and black reformatting Signed-off-by: Laplasjan107 <[email protected]> * (partial) PP fix Signed-off-by: Piotr Kaminski <[email protected]> * Apply isort and black reformatting Signed-off-by: Laplasjan107 <[email protected]> --------- Signed-off-by: Piotr Kaminski <[email protected]> Signed-off-by: Laplasjan107 <[email protected]> Signed-off-by: Piotr Kamiński <[email protected]> Signed-off-by: artbataev <[email protected]> Co-authored-by: Piotr Kaminski <[email protected]> Co-authored-by: Laplasjan107 <[email protected]> Co-authored-by: artbataev <[email protected]>
…10875) * add token duration Signed-off-by: monica-sekoyan <[email protected]> * revert rnnt change Signed-off-by: monica-sekoyan <[email protected]> * add remove_extra_whitespaces arg to spe tokenizer Signed-off-by: monica-sekoyan <[email protected]> * add token duration retrieval Signed-off-by: monica-sekoyan <[email protected]> * add ignore_extra_whitespace to spe Signed-off-by: monica-sekoyan <[email protected]> * add compute_timestamp support for tdt Signed-off-by: monica-sekoyan <[email protected]> * fix config field name Signed-off-by: monica-sekoyan <[email protected]> * add refinement for tdt timestamps Signed-off-by: monica-sekoyan <[email protected]> * add segments timestamp support and refinement for ctc Signed-off-by: monica-sekoyan <[email protected]> * modify tests for ctc decoding timestamps Signed-off-by: monica-sekoyan <[email protected]> * add rnnt timestamp tests Signed-off-by: monica-sekoyan <[email protected]> * updated doc Signed-off-by: monica-sekoyan <[email protected]> * fix in test Signed-off-by: monica-sekoyan <[email protected]> * Apply isort and black reformatting Signed-off-by: monica-sekoyan <[email protected]> * fix of unicode char Signed-off-by: monica-sekoyan <[email protected]> * fix rnnt_decoding test Signed-off-by: monica-sekoyan <[email protected]> * workaround for tesst tokenizer Signed-off-by: monica-sekoyan <[email protected]> * Apply isort and black reformatting Signed-off-by: monica-sekoyan <[email protected]> * modify segments formation Signed-off-by: monica-sekoyan <[email protected]> * modify segments for ctc Signed-off-by: monica-sekoyan <[email protected]> * fix in ctc refinement Signed-off-by: monica-sekoyan <[email protected]> * Apply isort and black reformatting Signed-off-by: monica-sekoyan <[email protected]> * minor changes Signed-off-by: monica-sekoyan <[email protected]> * reverse offset change Signed-off-by: monica-sekoyan <[email protected]> * Apply isort and black reformatting Signed-off-by: monica-sekoyan <[email protected]> * warning mode=once Signed-off-by: monica-sekoyan <[email protected]> * Apply isort and black reformatting Signed-off-by: monica-sekoyan <[email protected]> * make ignore_extrawhitespaces false Signed-off-by: monica-sekoyan <[email protected]> * minor changes Signed-off-by: monica-sekoyan <[email protected]> * adjust changes to the tests Signed-off-by: monica-sekoyan <[email protected]> * modify prompt_formatter tests Signed-off-by: monica-sekoyan <[email protected]> * Apply isort and black reformatting Signed-off-by: monica-sekoyan <[email protected]> --------- Signed-off-by: monica-sekoyan <[email protected]> Signed-off-by: monica-sekoyan <[email protected]> Co-authored-by: monica-sekoyan <[email protected]>
* Basic online dynamic quantization with vLLM Signed-off-by: Jan Lasek <[email protected]> * Apply isort and black reformatting Signed-off-by: janekl <[email protected]> * vllm 0.6.3 updates Signed-off-by: Jan Lasek <[email protected]> * Pass quantization param in deploy_vllm_triton.py script Signed-off-by: Jan Lasek <[email protected]> --------- Signed-off-by: Jan Lasek <[email protected]> Signed-off-by: janekl <[email protected]> Co-authored-by: janekl <[email protected]>
* ci: Improve VM maintenance Signed-off-by: Oliver Koenig <[email protected]> * rename stuff Signed-off-by: Oliver Koenig <[email protected]> * title Signed-off-by: Oliver Koenig <[email protected]> * use team Signed-off-by: Oliver Koenig <[email protected]> * run on failure too Signed-off-by: Oliver Koenig <[email protected]> * fix Signed-off-by: Oliver Koenig <[email protected]> * yrdy Signed-off-by: Oliver Koenig <[email protected]> * f Signed-off-by: Oliver Koenig <[email protected]> * test Signed-off-by: Oliver Koenig <[email protected]> * fix Signed-off-by: Oliver Koenig <[email protected]> * f Signed-off-by: Oliver Koenig <[email protected]> * f Signed-off-by: Oliver Koenig <[email protected]> * f Signed-off-by: Oliver Koenig <[email protected]> --------- Signed-off-by: Oliver Koenig <[email protected]>
* add packed seq recipes Signed-off-by: Chen Cui <[email protected]> * Apply isort and black reformatting Signed-off-by: cuichenx <[email protected]> * extraneous prefix Signed-off-by: Chen Cui <[email protected]> * Apply isort and black reformatting Signed-off-by: cuichenx <[email protected]> --------- Signed-off-by: Chen Cui <[email protected]> Signed-off-by: cuichenx <[email protected]> Co-authored-by: cuichenx <[email protected]> Co-authored-by: Pablo Garay <[email protected]>
* adapter inference first commit Signed-off-by: Chen Cui <[email protected]> * Apply isort and black reformatting Signed-off-by: cuichenx <[email protected]> * Fix yaml serialization Signed-off-by: Hemil Desai <[email protected]> * add copyright header Signed-off-by: Chen Cui <[email protected]> * Apply isort and black reformatting Signed-off-by: cuichenx <[email protected]> * revert accidental commit Signed-off-by: Chen Cui <[email protected]> --------- Signed-off-by: Chen Cui <[email protected]> Signed-off-by: cuichenx <[email protected]> Signed-off-by: Hemil Desai <[email protected]> Co-authored-by: cuichenx <[email protected]> Co-authored-by: Hemil Desai <[email protected]> Co-authored-by: Pablo Garay <[email protected]>
* ci: Switch to CPU only runner Signed-off-by: Oliver Koenig <[email protected]> * runner Signed-off-by: Oliver Koenig <[email protected]> * Raise EnvironmentError when CUDA is unavailable to prevent cuda-python failure Signed-off-by: Vladimir Bataev <[email protected]> * Fix error message Signed-off-by: Vladimir Bataev <[email protected]> * update Signed-off-by: Oliver Koenig <[email protected]> * fix Signed-off-by: Oliver Koenig <[email protected]> * finalize Signed-off-by: Oliver Koenig <[email protected]> --------- Signed-off-by: Oliver Koenig <[email protected]> Signed-off-by: Vladimir Bataev <[email protected]> Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Rauf <[email protected]>
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: pablo-garay <[email protected]>
* ci: Send team alerts on specific keywords Signed-off-by: Oliver Koenig <[email protected]> * f Signed-off-by: Oliver Koenig <[email protected]> --------- Signed-off-by: Oliver Koenig <[email protected]>
* change timestamps tests Signed-off-by: Monica Sekoyan <[email protected]> * Apply isort and black reformatting Signed-off-by: monica-sekoyan <[email protected]> --------- Signed-off-by: Monica Sekoyan <[email protected]> Signed-off-by: monica-sekoyan <[email protected]> Co-authored-by: monica-sekoyan <[email protected]>
* Add qwen recipe * Apply isort and black reformatting Signed-off-by: suiyoubi <[email protected]> * change to TP1 for small models Signed-off-by: Ao Tang <[email protected]> --------- Signed-off-by: suiyoubi <[email protected]> Signed-off-by: Ao Tang <[email protected]> Co-authored-by: suiyoubi <[email protected]>
* Add copyright notice * Apply isort and black reformatting Signed-off-by: pablo-garay <[email protected]> --------- Signed-off-by: pablo-garay <[email protected]> Co-authored-by: pablo-garay <[email protected]>
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: pablo-garay <[email protected]>
* add deprecation note Signed-off-by: dimapihtar <[email protected]> * Apply isort and black reformatting Signed-off-by: dimapihtar <[email protected]> --------- Signed-off-by: dimapihtar <[email protected]> Signed-off-by: dimapihtar <[email protected]> Co-authored-by: dimapihtar <[email protected]>
…r models (#10924) * initial commit * adding example t5_generate.py * workable inference code * updating code * update cpde * workable solution for T5 tokenizer (we add 100 sentinel tokens when initializing tokenizer throug setting config, instead of adding after initialization) * separate autokenizer's changes to another PR * cleaning code * addressing Marc's comments * addressing Marc's reviews * update code after merge * small fix * Apply isort and black reformatting Signed-off-by: huvunvidia <[email protected]> --------- Signed-off-by: huvunvidia <[email protected]> Co-authored-by: Huy Vu2 <[email protected]> Co-authored-by: root <[email protected]> Co-authored-by: huvunvidia <[email protected]>
… in energon MultiModalSampleConfig (#11041) Signed-off-by: Guyue Huang <[email protected]> Co-authored-by: Guyue Huang <[email protected]>
* update width pruning example defaults Signed-off-by: Keval Morabia <[email protected]> * Update Dockerfile.ci Signed-off-by: Keval Morabia <[email protected]> * Undo CI version update Signed-off-by: Keval Morabia <[email protected]> --------- Signed-off-by: Keval Morabia <[email protected]>
* Add recipe for sc1/sc2 * Apply isort and black reformatting Signed-off-by: suiyoubi <[email protected]> * typo fixed --------- Signed-off-by: suiyoubi <[email protected]> Co-authored-by: suiyoubi <[email protected]>
* Add copyright notice * Add copyright notice
* wrap batch_sampler Signed-off-by: Farhad Ramezanghorbani <[email protected]> * Apply isort and black reformatting Signed-off-by: farhadrgh <[email protected]> * pass dataloader mode * Apply isort and black reformatting Signed-off-by: farhadrgh <[email protected]> * pass dataloader mode Signed-off-by: Farhad Ramezanghorbani <[email protected]> * pass dataloader mode Signed-off-by: Farhad Ramezanghorbani <[email protected]> * resolve conflict Signed-off-by: Farhad Ramezanghorbani <[email protected]> * change import Signed-off-by: Farhad Ramezanghorbani <[email protected]> --------- Signed-off-by: Farhad Ramezanghorbani <[email protected]> Signed-off-by: farhadrgh <[email protected]> Signed-off-by: Chen Cui <[email protected]> Co-authored-by: Chen Cui <[email protected]>
* llama3 finetuning perf recipes progress capture Signed-off-by: Valerie Sarge <[email protected]> * Small syntax fix Signed-off-by: Valerie Sarge <[email protected]> * syntax Signed-off-by: Valerie Sarge <[email protected]> * Apply isort and black reformatting Signed-off-by: vysarge <[email protected]> * Correct ddp setting Signed-off-by: Valerie Sarge <[email protected]> * Fix hasattr check Signed-off-by: Valerie Sarge <[email protected]> * bf16 grad Signed-off-by: Valerie Sarge <[email protected]> * Update configs for 8b + 70b Signed-off-by: Valerie Sarge <[email protected]> * Set wgrad_deferral_limit Signed-off-by: Valerie Sarge <[email protected]> --------- Signed-off-by: Valerie Sarge <[email protected]> Signed-off-by: vysarge <[email protected]> Co-authored-by: vysarge <[email protected]>
* Set TE spec name for NeMo to HF checkpoint converters Signed-off-by: Keval Morabia <[email protected]> * Apply isort and black reformatting Signed-off-by: kevalmorabia97 <[email protected]> * Update convert_falcon_nemo_to_hf.py Signed-off-by: Keval Morabia <[email protected]> --------- Signed-off-by: Keval Morabia <[email protected]> Signed-off-by: kevalmorabia97 <[email protected]> Co-authored-by: kevalmorabia97 <[email protected]>
Signed-off-by: Oliver Koenig <[email protected]>
* switch to NeMo 2.0 recipes Signed-off-by: dimapihtar <[email protected]> * Apply isort and black reformatting Signed-off-by: dimapihtar <[email protected]> * add NeMo 2.0 recipe support Signed-off-by: dimapihtar <[email protected]> * Apply isort and black reformatting Signed-off-by: dimapihtar <[email protected]> * fix unit tests Signed-off-by: dimapihtar <[email protected]> * Apply isort and black reformatting Signed-off-by: dimapihtar <[email protected]> * fix auto conf ci test Signed-off-by: dimapihtar <[email protected]> * remove unused imports Signed-off-by: dimapihtar <[email protected]> * Apply isort and black reformatting Signed-off-by: dimapihtar <[email protected]> * fix stlye Signed-off-by: dimapihtar <[email protected]> * fix typo Signed-off-by: dimapihtar <[email protected]> --------- Signed-off-by: dimapihtar <[email protected]> Signed-off-by: dimapihtar <[email protected]> Co-authored-by: dimapihtar <[email protected]>
* initial commit * adding all recipes and tests * add tests for 220m, 3b, 11b; runnable with example_nemorun_t5.py * restore test_llama3_70b.py * remove unused libraties * remove unused libraries * address Ali's comments * address Ali's comments * edit test_t5_3b.py * Apply isort and black reformatting Signed-off-by: huvunvidia <[email protected]> --------- Signed-off-by: huvunvidia <[email protected]> Co-authored-by: Huy Vu2 <[email protected]> Co-authored-by: huvunvidia <[email protected]>
* initial commit Signed-off-by: Piotr Kaminski <[email protected]> * cleanup Signed-off-by: Piotr Kaminski <[email protected]> * fix fabric num nodes Signed-off-by: Piotr Kaminski <[email protected]> * Apply isort and black reformatting Signed-off-by: Laplasjan107 <[email protected]> * make style coherent Signed-off-by: Piotr Kaminski <[email protected]> --------- Signed-off-by: Piotr Kaminski <[email protected]> Signed-off-by: Laplasjan107 <[email protected]> Co-authored-by: Piotr Kaminski <[email protected]> Co-authored-by: Laplasjan107 <[email protected]>
Signed-off-by: Oliver Koenig <[email protected]>
Signed-off-by: Shashank Verma <[email protected]>
…, and training micro-optimizations (#11058) * Canary 2 prompt formatter Signed-off-by: Piotr Żelasko <[email protected]> * autoregister canary2 prompt format Signed-off-by: Piotr Żelasko <[email protected]> * lift the restrictions in canary tokenizer Signed-off-by: Piotr Żelasko <[email protected]> * work around canary tokenizer Signed-off-by: Piotr Żelasko <[email protected]> * unit test + fix number of issues Signed-off-by: Piotr Żelasko <[email protected]> * Add Canary2 and Canary1 language name back compat Signed-off-by: Piotr Żelasko <[email protected]> * 16% speedup Canary training on 1GPU Signed-off-by: Piotr Żelasko <[email protected]> * Apply isort and black reformatting Signed-off-by: pzelasko <[email protected]> * Fixes for Canary loss masking in train and val Signed-off-by: Piotr Żelasko <[email protected]> * Simplified language codes back to Canary1 format but expanded to all ISO lang codes Signed-off-by: Piotr Żelasko <[email protected]> * Canary <0.5s inference fix via padding Signed-off-by: Piotr Żelasko <[email protected]> * Add Lhotse issue workaround Signed-off-by: Piotr Żelasko <[email protected]> * Make loss masking on prompt optional and disabled by default Signed-off-by: Piotr Żelasko <[email protected]> * Fix copyright notices Signed-off-by: Piotr Żelasko <[email protected]> * Make pnc optional Signed-off-by: Piotr Żelasko <[email protected]> --------- Signed-off-by: Piotr Żelasko <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> Signed-off-by: pzelasko <[email protected]> Co-authored-by: pzelasko <[email protected]>
* first commit Signed-off-by: Chen Cui <[email protected]> * refactor lora Signed-off-by: Chen Cui <[email protected]> * refactor lora Signed-off-by: Chen Cui <[email protected]> * refactor dora Signed-off-by: Chen Cui <[email protected]> * add lora linear layers for qkv and fc1 Signed-off-by: Chen Cui <[email protected]> * add sharded state dict Signed-off-by: Chen Cui <[email protected]> * clean up Signed-off-by: Chen Cui <[email protected]> * doc strings Signed-off-by: Chen Cui <[email protected]> * refactor Signed-off-by: Chen Cui <[email protected]> * Apply isort and black reformatting Signed-off-by: cuichenx <[email protected]> * clean up Signed-off-by: Chen Cui <[email protected]> * Apply isort and black reformatting Signed-off-by: cuichenx <[email protected]> * doc strings Signed-off-by: Chen Cui <[email protected]> * Apply isort and black reformatting Signed-off-by: cuichenx <[email protected]> * add ci Signed-off-by: Chen Cui <[email protected]> * typo Signed-off-by: Chen Cui <[email protected]> * Apply isort and black reformatting Signed-off-by: cuichenx <[email protected]> --------- Signed-off-by: Chen Cui <[email protected]> Signed-off-by: cuichenx <[email protected]> Co-authored-by: cuichenx <[email protected]>
* ci: Run secrets detector on merge-commit Signed-off-by: Oliver Koenig <[email protected]> * fix Signed-off-by: Oliver Koenig <[email protected]> --------- Signed-off-by: Oliver Koenig <[email protected]>
* chore(beep boop 🤖): Bump `MCORE_TAG=67a50f2...` (2024-11-28) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Signed-off-by: Dennis <[email protected]> Signed-off-by: Victarry <[email protected]> Signed-off-by: Oliver Koenig <[email protected]> Co-authored-by: Zhenhuan Liu <[email protected]> Co-authored-by: Victarry <[email protected]> Co-authored-by: Dennis <[email protected]>
* Handle exception when importing RetroGPTChunkDatasets Signed-off-by: Guyue Huang <[email protected]> * Update baseline Signed-off-by: guyueh1 <[email protected]> * Revert "Update baseline" This reverts commit 21a4734. * Update baseline Signed-off-by: guyueh1 <[email protected]> * Revert "Update baseline" This reverts commit 8d35fcc. --------- Signed-off-by: Guyue Huang <[email protected]> Signed-off-by: guyueh1 <[email protected]> Co-authored-by: guyueh1 <[email protected]>
* add bert in nemo2 * Add bert loss * update bert with latest nemo * Apply isort and black reformatting Signed-off-by: suiyoubi <[email protected]> * Bert update * Apply isort and black reformatting Signed-off-by: suiyoubi <[email protected]> * import optimize * Apply isort and black reformatting Signed-off-by: suiyoubi <[email protected]> * pylint * Apply isort and black reformatting Signed-off-by: suiyoubi <[email protected]> * pylint * Apply isort and black reformatting Signed-off-by: suiyoubi <[email protected]> * pylint * Apply isort and black reformatting Signed-off-by: suiyoubi <[email protected]> * more comment * use lightning package * Apply isort and black reformatting Signed-off-by: suiyoubi <[email protected]> * comments resolved * Add loss * Apply isort and black reformatting Signed-off-by: suiyoubi <[email protected]> * Fix NaN loss when resume * Apply isort and black reformatting Signed-off-by: suiyoubi <[email protected]> * add default for num_tokentypes * update forward output to be a dict * Apply isort and black reformatting Signed-off-by: suiyoubi <[email protected]> * ReName to HuggingFace Bert Model * Add BertLarge * Apply isort and black reformatting Signed-off-by: suiyoubi <[email protected]> * Add BERT Tests * Apply isort and black reformatting Signed-off-by: suiyoubi <[email protected]> * typo * add exporter * Apply isort and black reformatting Signed-off-by: suiyoubi <[email protected]> * Fix Unit Tests * Apply isort and black reformatting Signed-off-by: suiyoubi <[email protected]> * add sig * rename pretraining dataset testing for bert * delete pretraining dataset testing for bert * resolve TE dependency * Apply isort and black reformatting Signed-off-by: suiyoubi <[email protected]> * resolve TE dependency * resolve TE dependency * resolve TE dependency * Apply isort and black reformatting Signed-off-by: suiyoubi <[email protected]> * resolve TE dependency * Apply isort and black reformatting Signed-off-by: suiyoubi <[email protected]> --------- Signed-off-by: suiyoubi <[email protected]> Co-authored-by: suiyoubi <[email protected]>
Previously if model_prefix wasn't part of model_state_dict, then the nemo_model_conversion_dict would contain keys that looked like `.decoder....` with an extra "." prefix. This just makes sure model_prefix isn't empty before adding the model_prefix in. Signed-off-by: Terry Kong <[email protected]>
…11471) * update restore from config for continual training Signed-off-by: yaoyu-33 <[email protected]> * update star coder yaml Signed-off-by: yaoyu-33 <[email protected]> --------- Signed-off-by: yaoyu-33 <[email protected]>
* Update dist_ckpt.rst with best practices Signed-off-by: FortunaZhang <[email protected]> * Update dist_ckpt.rst with best practices Signed-off-by: FortunaZhang <[email protected]> * Add files via upload Signed-off-by: FortunaZhang <[email protected]> * Update dist_ckpt.rst per reviewer suggestions Signed-off-by: FortunaZhang <[email protected]> * Update dist ckpt image source to Release assets Signed-off-by: FortunaZhang <[email protected]> * Update dist_ckpt.rst Signed-off-by: FortunaZhang <[email protected]> --------- Signed-off-by: FortunaZhang <[email protected]>
…ec (#11189) * fix: regular torch optims (e.g., sgd) no longer error with closure spec Signed-off-by: Terry Kong <[email protected]> * address style Signed-off-by: Terry Kong <[email protected]> * revert old pop Signed-off-by: Terry Kong <[email protected]> * Update nemo/core/optim/optimizer_with_main_params.py Co-authored-by: Ananth Subramaniam <[email protected]> Signed-off-by: Terry Kong <[email protected]> Signed-off-by: Terry Kong <[email protected]> * refine comment Signed-off-by: Terry Kong <[email protected]> --------- Signed-off-by: Terry Kong <[email protected]> Signed-off-by: Terry Kong <[email protected]> Co-authored-by: Ananth Subramaniam <[email protected]>
Signed-off-by: Oliver Koenig <[email protected]>
* Init with parallelism validation * Apply isort and black reformatting Signed-off-by: BoxiangW <[email protected]> * Add config checks * Apply isort and black reformatting Signed-off-by: BoxiangW <[email protected]> * fix * Add unit tests on configs validation * Add test * Add copyright * Change test * testing * Test --------- Signed-off-by: BoxiangW <[email protected]> Co-authored-by: BoxiangW <[email protected]>
* fix finetuning PP Signed-off-by: Chen Cui <[email protected]> * refactor to avoid setting a class attribute in a class property Signed-off-by: Chen Cui <[email protected]> * Fix CI test script Signed-off-by: Chen Cui <[email protected]> --------- Signed-off-by: Chen Cui <[email protected]>
Signed-off-by: Ante Jukić <[email protected]>
* config hierarchy Signed-off-by: Malay Nagda <[email protected]> * better formatting Signed-off-by: Malay Nagda <[email protected]> * structure change Signed-off-by: Malay Nagda <[email protected]> * structure change Signed-off-by: Malay Nagda <[email protected]> * correct link Signed-off-by: Malay Nagda <[email protected]> * location for config map Signed-off-by: Malay Nagda <[email protected]> --------- Signed-off-by: Malay Nagda <[email protected]>
…ather (#11486) * Add disable/enable forward pre hook for DDP and overlap param gather Signed-off-by: Hemil Desai <[email protected]> * Fix Signed-off-by: Hemil Desai <[email protected]> * Force param sync before saving checkpoint Signed-off-by: Hemil Desai <[email protected]> * fix Signed-off-by: Hemil Desai <[email protected]> * Apply isort and black reformatting Signed-off-by: hemildesai <[email protected]> --------- Signed-off-by: Hemil Desai <[email protected]> Signed-off-by: hemildesai <[email protected]> Co-authored-by: hemildesai <[email protected]>
github-actions
bot
added
ASR
NLP
Multi Modal
TTS
core
common
CI
Audio
Speaker Tasks
labels
Dec 7, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What does this PR do ?
Add a one line overview of what this PR aims to accomplish.
Collection: [Note which collection this PR will affect]
Changelog
Usage
# Add a code snippet demonstrating how to use this
GitHub Actions CI
The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.
The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".
Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information