From 130ed82c0befb0ee8caf65088113dbb05224da5d Mon Sep 17 00:00:00 2001
From: "dependabot[bot]" <49699333+dependabot[bot]@users.noreply.github.com>
Date: Tue, 25 Feb 2025 00:41:07 +0800
Subject: [PATCH] Bump transformers from 4.38.2 to 4.48.0 in /research/fed-bpt
(#3218)
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Bumps [transformers](https://github.com/huggingface/transformers) from
4.38.2 to 4.48.0.
Sourced from transformers's
releases. The ModernBert model was proposed in Smarter, Better, Faster, Longer:
A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long
Context Finetuning and Inference by Benjamin Warner, Antoine
Chaffin, Benjamin Clavié, Orion Weller, Oskar Hallström, Said
Taghadouini, Alexis Galalgher, Raja Bisas, Faisal Ladhak, Tom Aarsen,
Nathan Cooper, Grifin Adams, Jeremy Howard and Iacopo Poli. It is a refresh of the traditional encoder architecture, as used in
previous models such as BERT
and RoBERTa. It builds on BERT and implements many modern architectural
improvements which have been developed since its original release, such
as: The Aria model was proposed in Aria: An Open Multimodal
Native Mixture-of-Experts Model by Li et al. from the Rhymes.AI
team. Aria is an open multimodal-native model with best-in-class
performance across a wide range of multimodal, language, and coding
tasks. It has a Mixture-of-Experts architecture, with respectively 3.9B
and 3.5B activated parameters per visual token and text token. We add a Here's a general usage example: checkpoint = "timm/resnet50.a1_in1k"
img = Image.open(urlopen(
'https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/beignets-task-guide.png'
)) image_processor = AutoImageProcessor.from_pretrained(checkpoint)
</tr></table>
Release notes
v4.48.0: ModernBERT, Aria, TimmWrapper, ColPali, Falcon3, Bamba,
VitPose, DinoV2 w/ Registers, Emu3, Cohere v2, TextNet, DiffLlama,
PixtralLarge, Moonshine
New models
ModernBERT
@warner-benjamin
in #35158Aria
@aymeric-roucher
in #34157
TimmWrapper
TimmWrapper
set of classes such that timm
models can be loaded in as transformer models into the library.import torch
from urllib.request import urlopen
from PIL import Image
from transformers import AutoConfig, AutoModelForImageClassification,
AutoImageProcessor
... (truncated)
6bc0fbc
[WIP] Emu3: add model (#33770)59e28c3
Fix flex_attention in training mode (#35605)7cf6230
push a fix for nowd6f446f
when filtering we can't use the convert script as we removed them8ce1e95
[test-all]af2d7ca
Add Moonshine (#34784)42b8e79
ModernBert: reuse GemmaRotaryEmbedding via modular + Integration tests
(#35459)e39c9f7
v4.48-release8de7b1b
Add flex_attn to diffllama (#35601)1e3ddcb
ModernBERT bug fixes (#35404)