← Back to report

huggingface/transformers

All events — 2026-03-19

Type AI Score Description Actor Reason Date
COMMIT 1.00 Update AFMoE architecture to use v5-style MoE impl (#44063) Commit message contains explicit AI assi 2026-03-19
COMMIT 1.00 Sdpa for owlvit (#42136) Commit message contains explicit AI assi 2026-03-17
COMMIT 1.00 :rotating_light: Validate config attributes (#41250) Commit message contains explicit AI assi 2026-03-16
COMMIT 1.00 Fix off-by-one in decode_spans boundary check (#44584) Commit message contains explicit AI assi 2026-03-12
PR 1.00 Fix #44155: [AudioFlamingo3] Batched inference produces inco PR body explicitly mentions AI collabora 2026-02-21
COMMIT 0.00 Fix glm dsa (#44564) 2026-03-19
COMMIT 0.00 🚨🚨 Refactor Image Processors to support different backends ( 2026-03-19
COMMIT 0.00 [generate] Never use `cache_position` anymore in generation 2026-03-19
COMMIT 0.00 Fix KeyError in convert_to_native_format for dict vocab (#44 2026-03-19
COMMIT 0.00 fix: XLNet: relative_positional_encoding computes on CPU eve 2026-03-19
COMMIT 0.00 Fix annotations reader for python 3.14 in `PreTrainedModel`
neo
2026-03-19
COMMIT 0.00 [CB] Better parametrization for compile (#44578) 2026-03-19
COMMIT 0.00 Fix `KeyError` when patching mistral regex (#43376) 2026-03-19
COMMIT 0.00 Correct code block formatting in weightconverter.md (#44839) 2026-03-19
COMMIT 0.00 deepseek_v2, deepseek_v3, and modernbert fix for having inco 2026-03-18
COMMIT 0.00 [Model] Add PP-OCRv5_server_rec and PP-OCRv5_mobile_rec mod 2026-03-18
COMMIT 0.00 Add `Jina-Embeddings-V3` Model (#44251) 2026-03-18
COMMIT 0.00 feat(ci): added a network debug report (#44636) 2026-03-18
COMMIT 0.00 Add GreedyLR adaptive learning rate scheduler (#44271) 2026-03-18
COMMIT 0.00 Fix unexpected `position_ids` keys when loading OwlViT model 2026-03-18
COMMIT 0.00 Update more modular examples (#44834) 2026-03-18
COMMIT 0.00 Fix and re-run modular converter on examples (#44833) 2026-03-18
COMMIT 0.00 Remove cache_position in more models (4 and last one) (#4482 2026-03-18
COMMIT 0.00 Fix loading issue in Sam3 (#44831) 2026-03-18
COMMIT 0.00 feat(integration): Add KubeflowCallback to enable automatic 2026-03-18
COMMIT 0.00 Add GGUF support for MiniMax-M2.1 model (#44526) 2026-03-18
COMMIT 0.00 Centralize AI agent templates in `.ai` (#44489) 2026-03-18
COMMIT 0.00 support xxxFast alias in v5 tokenizers (#44766) 2026-03-18
COMMIT 0.00 Remove cache_position in more models (3) (#44759) 2026-03-18
COMMIT 0.00 Fix `supports_{tp/pp}_plan` (#44696) 2026-03-18
COMMIT 0.00 [CI] Temporarily skip Mistral4 tests as they almost all fail 2026-03-18
COMMIT 0.00 update flex attention to use `return_aux` instead of `return 2026-03-18
COMMIT 0.00 [Gemma] Update conversion scripts for Transformers v5 Comapt 2026-03-18
COMMIT 0.00 fix bug embedding_size mismatch with hidden_size in electra 2026-03-18
COMMIT 0.00 Fix pegasus conversion (#44571) 2026-03-18
COMMIT 0.00 Fix repo-check bot (#44812) 2026-03-18
COMMIT 0.00 [docs] is_causal feature (#44777) 2026-03-17
COMMIT 0.00 docs(tasks): remove references to removed question-answering 2026-03-17
COMMIT 0.00 Fix configs with `@strict` (#44770) 2026-03-17
COMMIT 0.00 [AMD CI] Fix test failures across important models (#44632) 2026-03-17
COMMIT 0.00 Move VLM conversions to the main mapping (#44627) 2026-03-17
COMMIT 0.00 Fix config loading issues (type issues) (#44789) 2026-03-17
COMMIT 0.00 Remove `is_causal` from `EuroBertConfig` (#44774) 2026-03-17
COMMIT 0.00 model-linter: Added rule 10 (#44761) 2026-03-17
COMMIT 0.00 [fix] mistral 4 docs (#44776) 2026-03-16
COMMIT 0.00 Add Mistral 4 (#44760) 2026-03-16
COMMIT 0.00 Fix: Eurobert model was missing @strict decorator and invali 2026-03-16
COMMIT 0.00 fix: sig lip import (#44764) 2026-03-16
COMMIT 0.00 Disable async loading when quantizing on the fly (#44576) 2026-03-16
COMMIT 0.00 Bump torchao >=0.15 and fix quantization CI (#44604) 2026-03-16
COMMIT 0.00 Fix tensor indexing crash in serve generate_response KV cach 2026-03-16
COMMIT 0.00 [MistralCommonBackend] Upgrade mistral-common to v1.10.0 (#4 2026-03-16
COMMIT 0.00 Fix `mlcd` auto config/model/mapping issues (#44730) 2026-03-16
COMMIT 0.00 Fix bug and add XPU Expectations for qwen2 and jamba tests ( 2026-03-16
COMMIT 0.00 Add model lerobot PI0 to transformers (#44160) 2026-03-16
COMMIT 0.00 [medasr] doc update (#44633) 2026-03-16
COMMIT 0.00 Idefics3 without cache fix (#44607) 2026-03-16
COMMIT 0.00 Add XPU Expectations for vibe voice acoustic tokenizer tests 2026-03-16
COMMIT 0.00 Fix transformers serve's 422 unprocessable entity (#44620) 2026-03-16
COMMIT 0.00 Fix missing / incorrect `config` class in some model class d 2026-03-15
COMMIT 0.00 Update Nvidia CI docker file to use torch 2.10 (#44712) 2026-03-14
COMMIT 0.00 [`FA`] Fix fa detection (#44703) 2026-03-14
COMMIT 0.00 Fix `set_encoder` (#44698) 2026-03-14
COMMIT 0.00 [docs] cb config (#44675) 2026-03-13
COMMIT 0.00 Fix more model tester missing `parent` issue (#44685) 2026-03-13
COMMIT 0.00 :rotating_light: [`FA4`] Initial support (#42435) 2026-03-13
COMMIT 0.00 Add register method for `ParallelInterface` (#44640) 2026-03-13
COMMIT 0.00 [CB] [Bug] Fix crashes when running without cuda (#44673) 2026-03-13
COMMIT 0.00 Another (small) set of fixes required for tiny model creatio 2026-03-13
COMMIT 0.00 Fix CookieCutter (#44334) 2026-03-13
COMMIT 0.00 Fix AWQ tests for GPTQModel migration (#44654) 2026-03-13
COMMIT 0.00 [Model] Add PP-OCRV5_mobile_det Model Support (#43247) 2026-03-13
COMMIT 0.00 pipelines do not have modelcard (#44621) 2026-03-13
COMMIT 0.00 [`Chmv2`] Fix conversion after capture refactor (#44665) 2026-03-13
COMMIT 0.00 fix(models, testing): Fix Llama4 vision rotary meta tensor i 2026-03-13
COMMIT 0.00 [CB] Add dedicated config (#44434) 2026-03-13
COMMIT 0.00 fix(models): Forward timm model kwargs to timm.create_model 2026-03-13
COMMIT 0.00 Ensure same `dtype` for subconfig when `_from_config` (#4462 2026-03-13
COMMIT 0.00 Remove `cache_position` in more models (2) (#44602) 2026-03-12
COMMIT 0.00 fix: cast to proper dtype in EmbeddingParallel (#44612) 2026-03-12
COMMIT 0.00 Allow to disable stdout hiding for TP (#44608) 2026-03-12
COMMIT 0.00 Remove many output_attentions and other traced outputs on 10 2026-03-12
COMMIT 0.00 [Model] Add PP-OCRV5_server_det Model Support (#43274) 2026-03-12
COMMIT 0.00 fix: raise error if mm_token_type_ids not supplied (#44433) 2026-03-12
COMMIT 0.00 Fix output capturing for Backbones (#44638) 2026-03-12
COMMIT 0.00 Fix lfm2 kernel path (#44634) 2026-03-12
COMMIT 0.00 Fix for `VibeVoiceAcousticTokenizer` (#44628) 2026-03-12
COMMIT 0.00 Add an integration test for LASR using pipe and chunked deco
kho
2026-03-12
COMMIT 0.00 Fix more wrong HF hub checkpoint names (#44624) 2026-03-12
COMMIT 0.00 Update agentic contributions guidelines in AGENTS.md to forc 2026-03-12
COMMIT 0.00 Expand model-structure lint rules with a fast AST-based, ruf 2026-03-12
COMMIT 0.00 feat: add neuron in tensor parallelism initialization (#4449 2026-03-11
COMMIT 0.00 [WIP] FIX Make Mixtral LoRA loading work (#44478) 2026-03-11
COMMIT 0.00 Fix Llava tests for torch too! (#44476) 2026-03-11
COMMIT 0.00 Fix training ci and clean some tests (#44491) 2026-03-11
COMMIT 0.00 Add CHMv2 (#44595) 2026-03-11
COMMIT 0.00 Remove useless identity assignment (#44600) 2026-03-11
COMMIT 0.00 Add Yoni to run-slow workflow (#44598) 2026-03-11
COMMIT 0.00 Add shared VLM tests (#42964) 2026-03-11
COMMIT 0.00 Fix wrong (non-existing) checkpoints (#44549) 2026-03-11
COMMIT 0.00 Remove `cache_position` in more models (#44330) 2026-03-11
PR 0.00 Switch FP8 per tensor quant to use `torch._scaled_mm` 2026-03-19
PR 0.00 DeepGEMM 2026-03-18
PR 0.00 Update some type hints 2026-03-19
PR 0.00 Proposal to add Qwen3-ASR support [WIP] 2026-02-08
PR 0.00 [Model] Add PP-Chart2Table Model Support 2026-02-05
PR 0.00 Dequant fix 2026-03-18
PR 0.00 [Model] Add SLANeXt Model Support 2026-02-03
PR 0.00 🚨 Refactor ViT to updated standards 2025-10-17
PR 0.00 Add THD support in ESM 2026-02-19
PR 0.00 [Model] Add UVDoc Model Support 2026-01-21
PR 0.00 feat: added cache to the model linter 2026-03-17
PR 0.00 Propagate the model loading from transformers serve to chat 2026-03-16
PR 0.00 chore(typing): extend typing to `src/transformers/cli` 2026-03-10
PR 0.00 Fix core dumped when `NemotronH` is torch compiled 2026-03-19
PR 0.00 Officially launch parse_response 2026-03-13
PR 0.00 [CB] Add an option to return logprobs 2026-03-18
PR 0.00 fix: handle list-type _tied_weights_keys in _get_tied_weight 2026-03-19
PR 0.00 Fix glm dsa 2026-03-10
PR 0.00 [PoC] HF exporters 2025-11-03
PR 0.00 [Mistral] Fix query scaling for Mistral4 and Ministral3 2026-03-19
PR 0.00 Fix several based models' pipeline parallel support 2026-03-14
PR 0.00 Support Modular (!!) + Configs in `check_auto_docstrings` 2026-03-17
PR 0.00 Fix failing `Qwen3OmniModelIntegrationTests` 2026-03-19
PR 0.00 🚨🚨 Refactor Image Processors to support different backends 2026-01-27
PR 0.00 Dynamic weight conversion is recursive 2026-02-26
PR 0.00 FSDP2 native support in transformers 2026-02-17
PR 0.00 [generate] Never use `cache_position` anymore in generation 2026-03-18
PR 0.00 add HyperClovaX Vision 2026-02-27
PR 0.00 perceptron: Isaac-0.1 implementation 2025-09-18
PR 0.00 refactor: rope in model, flatten vision, rely on qwen3 backo 2026-03-19
PR 0.00 enable tp for benchmark 2026-02-05
PR 0.00 Update AFMoE architecture to use v5-style MoE impl 2026-02-17
PR 0.00 Fix KeyError in convert_to_native_format for dict vocab 2026-03-05
PR 0.00 Use `index_select` instead of advanced indexing in `batched_ 2026-03-13
PR 0.00 fix: XLNet: relative_positional_encoding computes on CPU eve 2026-03-17
PR 0.00 Fix annotations reader for python 3.14 in `PreTrainedModel`
neo
2026-03-13
PR 0.00 fix: allow AutoImageProcessor to load from URL 2026-03-18
PR 0.00 Add Music Flamingo 2026-01-27
PR 0.00 [CB] [Minor] Simplify test suite 2026-03-19
PR 0.00 fix(testing): Fix PaliGemma 2 and PaddleOCR-VL test failures 2026-03-16
PR 0.00 fix: Add MXFP4 MoE/attention backward kernels 2026-02-05
PR 0.00 fix: handle unpicklable tokenizers in ProcessorMixin.to_dict 2026-03-19
PR 0.00 deepseek_v2, deepseek_v3, and modernbert fix for having inco 2026-03-17
PR 0.00 fix: move comments before @torch.jit.script decorator for Py 2026-03-19
PR 0.00 Fix DEIM config export and public API 2026-03-19
PR 0.00 Add /v1/completions endpoint (OpenAI legacy completions API) 2026-03-10
PR 0.00 [Misc] add enable_thinking to template kwargs 2026-03-18
PR 0.00 model: Add DEIMv2 to Transformers 2026-02-27
PR 0.00 Add xcodec2 model 2026-02-20
PR 0.00 [`Mllama`] Fix workaround compile 2026-03-19
PR 0.00 Fix Zamba2MambaMixer ignoring use_mamba_kernels=False 2026-03-19
PR 0.00 Fix AutoImageProcessor URL loading regression 2026-03-19
PR 0.00 Goodbye cache position 2026-03-13
PR 0.00 [CB] Better parametrization for compile 2026-03-10
PR 0.00 Allow kernel modules to declare their preferred mask functio 2026-03-13
PR 0.00 [Model] Add PP-OCRV5_mobile_rec Model Support 2026-02-06
PR 0.00 Fix AutoImageProcessor.from_pretrained failing with URL inpu 2026-03-18
PR 0.00 Fix whisper return language 2025-11-16
PR 0.00 fix(flaky): use a fixture for `set_seed` and single-threadin 2026-02-07
PR 0.00 Add `Jina-Embeddings-V3` Model 2026-02-24
PR 0.00 [docs] training on specific hardware 2026-03-17
PR 0.00 Fix `AutoImageProcessor` to correctly detect local implement 2026-03-13
PR 0.00 Use doc-builder runnable example for GLM-ASR 2026-02-25
PR 0.00 Fix Mllama torch.compile failure caused by new attention mas 2026-03-19
PR 0.00 Fix `KeyError` when patching mistral regex 2026-01-20
PR 0.00 ci: add anti-slop action 2026-03-19
PR 0.00 Correct code block formatting in weightconverter.md 2026-03-19
PR 0.00 [Docs] Update DeiT model card to new format 2026-03-19
PR 0.00 Fix llama4 bnb mode 2026-03-11
PR 0.00 Add cu_seqlens support to OlmoHybridGatedDeltaNet for packed 2026-03-18
PR 0.00 Internalise the NomicBERT model 2025-12-29
PR 0.00 [docs] optimizers, hyperparam search, training features 2026-02-26
PR 0.00 [docs] model cards 2026-03-18
PR 0.00 Fix Mistral4 tests 2026-03-18
PR 0.00 [Model] Add PP-OCRv5_server_rec and PP-OCRv5_mobile_rec mod 2026-03-18
PR 0.00 small cleaning of quantization class 2025-12-04
PR 0.00 feat(ci): added a network debug report 2026-03-12
PR 0.00 Add GreedyLR adaptive learning rate scheduler 2026-02-25
PR 0.00 Fix unexpected `position_ids` keys when loading OwlViT model 2026-03-06
PR 0.00 Add Mistral 4 2026-03-16
PR 0.00 Add `base_model_tp_plan` to `OlmoeConfig` 2026-03-13
PR 0.00 Update more modular examples 2026-03-18
PR 0.00 fix(gpt2): Resolve NaN/Inf issue in lm_head on Python 3.13 w 2026-03-13
PR 0.00 Fix and re-run modular converter on examples 2026-03-18
PR 0.00 [Model] Add PP-OCRv5_server_rec Model Support 2026-02-06
PR 0.00 fix: add Float8 dtype fallback in modeling_utils.py 2026-03-11
PR 0.00 Remove cache_position in more models (4 and last one) 2026-03-18
PR 0.00 docs(pipelines): remove outdated question-answering example 2026-03-17
PR 0.00 Fix loading issue in Sam3 2026-03-18
PR 0.00 docs(quicktour): remove question-answering pipeline from qui 2026-03-18
PR 0.00 fix: handle dict vocab in CamembertTokenizer for tokenizer.j 2026-03-17
PR 0.00 Add MPS (Apple Silicon) example and documentation 2026-03-17
PR 0.00 fix: Cache XLNet relative_positional_encoding to avoid CPU c 2026-03-16
PR 0.00 fix: resolve false-positive regex warning for non-mistral mo 2026-03-16
PR 0.00 Fix: propagate interpolate_pos_encoding through PixioEmbeddi 2026-03-15
PR 0.00 feat(integration): Add KubeflowCallback to enable automatic 2026-03-06
PR 0.00 Add AudioFlamingoNext model 2026-03-18
PR 0.00 fix series of failed test case for janus model 2026-03-16
PR 0.00 Add GGUF support for MiniMax-M2.1 model 2026-03-08