Stars
Developer APIs to Accelerate LLM Projects
✨✨Latest Advances on Multimodal Large Language Models
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Generative Agents: Interactive Simulacra of Human Behavior
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Grab top 10 github trending projects every day and save it to issues. You can subscribe by watching this repo or via RSS
Large Language Model Text Generation Inference
A high-throughput and memory-efficient inference and serving engine for LLMs
Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.
Fine-tune and quantize Llama-2-like models to generate Python code using QLoRA, Axolot,..
ImageBind One Embedding Space to Bind Them All
deepspeedai / Megatron-DeepSpeed
Forked from NVIDIA/Megatron-LMOngoing research training transformer language models at scale, including: BERT & GPT-2
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Running large language models on a single GPU for throughput-oriented scenarios.
A Collection of BM25 Algorithms in Python
This sample provides a CDK project that allows you to deploy a serverless chat application based on API Gateway's WebSocket-based API feature.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Graphormer is a general-purpose deep learning backbone for molecular modeling.
Pretrained SMILES transformation model for finetuning for diverse molecular tasks.
Work related to the BioCreative CHEMDNER corpora
Using LSTM or Transformer to solve Image Captioning in Pytorch
AI and Machine Learning with Kubeflow, Amazon EKS, and SageMaker
Pytorch implementation of JointBERT: "BERT for Joint Intent Classification and Slot Filling"
slot filling, intent detection, joint training, ATIS & SNIPS datasets, the Facebook’s multilingual dataset, MIT corpus, E-commerce Shopping Assistant (ECSA) dataset, CoNLL2003 NER, ELMo, BERT, XLNet