id	post	tags
1	What Is Retrieval-Augmented Generation aka RAG \| NVIDIA Blogs	`[RAG]`
35	NVIDIA Releases Open Synthetic Data Generation Pipeline for Training Large Language Models	`[Dataset Collection]`
57	Generating Science: NVIDIA AI Accelerates HPC Research \| NVIDIA ...	`[RAG]`
99	New NVIDIA NeMo Retriever Microservices Boost LLM Accuracy and Throughput	`[RAG]`
137	Scaling Enterprise RAG with Accelerated Ethernet Networking and Networked Storage	`[RAG]`
138	Build Enterprise Retrieval-Augmented Generation Apps with NVIDIA Retrieval QA Embedding Model	`[Specialized Databases]` `[RAG]`
154	Leverage the Latest Open Models for Synthetic Data Generation with NVIDIA Nemotron-4 340B	`[Dataset Collection]` `[Dataset Cleaning and Preparation]`
169	Simplify Custom Generative AI Development with NVIDIA NeMo Microservices	`[Dataset Cleaning and Preparation]`
188	Curating Non-English Datasets for LLM Training with NVIDIA NeMo Curator	`[Dataset Cleaning and Preparation]` `[Dataset Collection]`
195	RAG 101: Retrieval-Augmented Generation Questions Answered	`[RAG]`
309	Scenario Diffusion helps Zoox vehicles navigate safety-critical situations	`[Dataset Collection]` `[Dataset Cleaning and Preparation]`
341	AWS Pi Day 2024: Use your data to power generative AI \| AWS ...	`[RAG]` `[Dataset Cleaning and Preparation]` `[Dataset Collection]`
344	Generative AI for Semiconductor Design and Verification \| AWS for ...	`[RAG]`
348	How Audi improved their chat experience with Generative AI on Amazon SageMaker	`[RAG]`
361	Learn how Amazon Pharmacy created their LLM-based chat-bot using Amazon SageMaker	`[RAG]`
372	The Fanatics generative AI hackathon \| AWS for Industries	`[RAG]`
378	Private network for data movement in generative AI \| Networking ...	`[RAG]`
386	Harnessing the power of enterprise data with generative AI: Insights from Amazon Kendra, LangChain, and large language models	`[RAG]`
401	Efficient continual pre-training LLMs for financial domains \| AWS ...	`[Dataset Collection]` `[Dataset Cleaning and Preparation]`
496	Advancements in machine learning for machine learning	`[Dataset Collection]`
515	Improving Gboard language models via private federated analytics	`[Dataset Cleaning and Preparation]`
633	New GenAI Databases Retrieval App helps improves LLM answers ...	`[RAG]`
637	Writer.com wins generative AI success with Google Cloud databases	`[Specialized Databases]`
639	How Spanner vector search supports generative AI apps \| Google ...	`[Specialized Databases]`
641	Build User Authentication into your GenAI App Accessing Database ...	`[RAG]`
653	genAI and google cloud ML to get actionable insight \| Google Cloud ...	`[Feature Engineering]` `[Dataset Collection]` `[Dataset Cleaning and Preparation]`
680	Perform product analysis with generative AI and BigQuery \| Google ...	`[Specialized Databases]` `[Dataset Collection]` `[Dataset Cleaning and Preparation]`
682	Memorystore for Redis vector search and LangChain integrations for gen AI	`[RAG]`
744	IBM watsonx AI and data platform, security solutions and consulting ...	`[Dataset Cleaning and Preparation]`
779	IBM watsonx Assistant transforms content into conversational ...	`[RAG]`
804	The importance of data ingestion and integration for enterprise AI ...	`[Dataset Cleaning and Preparation]` `[Dataset Collection]` `[RAG]`
828	The recipe for RAG: How cloud services enable generative AI ...	`[RAG]`
888	Optimize Azure OpenAI Applications with Semantic Caching	`[Specialized Databases]`
1058	GUEST POST - EmbedElite Meets Semantic Kernel: A Game ...	`[RAG]` `[Specialized Databases]`
1089	Azure Cosmos DB Conf 2024: Accelerating Innovation in AI and Data	`[RAG]`
1090	Decoding AI: Part 7, Retrieval Augmented Generation with GAI	`[RAG]`
1122	Azure OpenAI On Your Data with Semantic Kernel \| Semantic Kernel	`[Dataset Cleaning and Preparation]`
1155	GraphRAG: New tool for complex data discovery now on GitHub	`[RAG]`
1156	GraphRAG: Unlocking LLM discovery on narrative private data	`[RAG]`
1161	Unified Database: Laying the foundation for large language model vertical applications	`[Specialized Databases]` `[RAG]`
1219	Intelligent monitoring: Towards AI-assisted monitoring for cloud services	`[Feature Engineering]`
1245	Lessons learned from building LLM applications - SAP Community	`[RAG]`
1263	RAG with SAP HANA Cloud Vector Engine, GenAI Hub & CAP	`[RAG]` `[Specialized Databases]`
1269	Harnessing Generative AI Capabilities with SAP HANA Cloud Vector Engine - Part 1 [Architecture] - SAP ...	`[Specialized Databases]`
1272	GenAI Integration with Events-to-Business Framework for Intelligent Action Synthesis - SAP ...	`[RAG]`
1286	Enhance data privacy in SAP CAP based GenAI application with ...	`[Dataset Cleaning and Preparation]`
1294	Share corporate info with an LLM using Embeddings - SAP ...	`[Specialized Databases]`
1297	A Journey into Retrieval-Augmented Generation (RAG) on SAP BTP	`[RAG]`
1298	Early Adopter Care Program SAP HANA Cloud Vector Engine	`[RAG]`
1301	Generative AI: Some thoughts on using Embeddings - SAP Community	`[Specialized Databases]` `[Dataset Cleaning and Preparation]`
1308	Enhancing S/4HANA with SAP HANA Cloud Vector Store and GenAI	`[Specialized Databases]`
1309	How Data Anonymization in SAP HANA secure Generative AI applications through SAP Datasphere	`[Dataset Cleaning and Preparation]`
1314	Vectorize your Data : SAP HANA Cloud's Vector Engine for Unified Data Excellence - SAP ...	`[RAG]`
1320	SAP Inside Track Bangalore 2024: Generative AI Extravaganza - SAP ...	`[RAG]`
1328	First steps using the Hana Vector Engine with SAP GEN AI	`[RAG]`
1334	Which Embedding Model should I use with my Corporate LLM?	`[Specialized Databases]`
1341	HANA Vector Engine and LangChain - SAP Community	`[Specialized Databases]`
1348	Analysts react to MySQL HeatWave Gen AI and vector store innovations	`[Specialized Databases]` `[Dataset Cleaning and Preparation]`
1364	From inference to RAG: Choosing CPUs for efficient generative AI application deployments	`[RAG]`
1370	Unlock the Power of Oracle AI: Near Real-Time data to Feed Your RAG	`[Feature Engineering]`
1372	Generative AI in HeatWave: Introduction	`[Specialized Databases]`
1382	Behind the Scenes: Using OCI Generative AI Agents to improve contextual accuracy	`[RAG]`
1384	Generative AI Chatbot using LLaMA-2, Qdrant, RAG, LangChain & Streamlit	`[RAG]` `[Specialized Databases]`
1395	GenAI RAG Likes Explicit Relationships: Use Graphs!	`[RAG]`
1398	Leading Industry Analysts Comment on the Release of Oracle Database 23ai	`[Specialized Databases]` `[RAG]`
1399	OCI Search with OpenSearch 2.11 delivers easy access to latest AI innovations	`[Specialized Databases]`
1426	Similarity Search in Oracle Autonomous Database 19c	`[Specialized Databases]`
1436	Embedded intelligence: Storing and retrieving embeddings in a feature store	`[Specialized Databases]`
1445	Securing the LLM Stack - Cisco Blogs	`[RAG]`
1565	What is prompt grounding? — A generative AI tutorial \| Salesforce	`[RAG]`
1571	Vector Databases, Built for the AI Era, Make Your AI Better – Here's ...	`[Specialized Databases]`
1759	How to enable RAG (Retrieval Augmented Generation) on an AMD Ryzen™ AI PC or Radeon Graphics Card - AMD ...	`[RAG]`
1797	Industrial Designer Blends Art and OpenUSD to Create 3D Assets for AI Training	`[Dataset Collection]` `[Dataset Labeling and Annotation]`
1862	Rack 'n' Roll: NVIDIA Grace Hopper Systems Gather at GTC ...	`[RAG]`
1897	Unlocking the Future of Manufacturing With OpenUSD on Siemens Teamcenter X	`[Dataset Collection]`
1982	Pro Tips for Building Multilingual Recommender Systems \| NVIDIA ...	`[Feature Engineering]` `[Dataset Collection]`
2013	Accelerating Vector Search: Using GPU-Powered Indexes with RAPIDS RAFT	`[Specialized Databases]`
2018	Accelerating Vector Search: Fine-Tuning GPU Index Algorithms	`[Specialized Databases]`
2055	Boost Synthetic Data Generation with Low-Code Workflows in NVIDIA Omniverse Replicator 1.10	`[Dataset Collection]` `[Dataset Cleaning and Preparation]`
2072	How to Train Autonomous Mobile Robots to Detect Warehouse Pallet Jacks Using Synthetic Data	`[Dataset Collection]` `[Dataset Cleaning and Preparation]`
2163	RAG 101: Demystifying Retrieval-Augmented Generation Pipelines	`[RAG]`
2165	Teaching AVs the Language of Human Driving Behavior with Trajeglish	`[Dataset Collection]`
2247	Video: Build a RAG-Powered Chatbot in Five Minutes \| NVIDIA ...	`[RAG]`
2284	An Easy Introduction to Multimodal Retrieval-Augmented Generation	`[RAG]`
2326	Scale and Curate High-Quality Datasets for LLM Training with NVIDIA NeMo Curator	`[Dataset Cleaning and Preparation]` `[Dataset Collection]` `[Feature Engineering]`
2349	Explainer: What Is Retrieval-Augmented Generation? \| NVIDIA ...	`[RAG]`
2468	How to Train an Object Detection Model for Visual Inspection with Synthetic Data	`[Dataset Collection]` `[Dataset Cleaning and Preparation]`
2531	Optimize AI Model Performance and Maintain Data Privacy with Hybrid RAG	`[RAG]`
2577	Transforming Telco Network Operations Centers with NVIDIA NeMo Retriever and NVIDIA NIM	`[Dataset Collection]` `[RAG]`
2581	Accelerate AI Infrastructure Using an NVIDIA BlueField-3 DPU Integration with DDN Storage	`[Dataset Collection]` `[Dataset Cleaning and Preparation]`
2587	Creating Synthetic Data Using Llama 3.1 405B \| NVIDIA Technical ...	`[Dataset Collection]` `[RAG]`
2599	Automating Telco Network Design using NVIDIA NIM and NVIDIA NeMo	`[RAG]`
2617	How to Build a Generative AI-Enabled Synthetic Data Pipeline with OpenUSD	`[Dataset Collection]`
2624	Curating Custom Datasets for LLM Parameter-Efficient Fine-Tuning with NVIDIA NeMo Curator	`[Dataset Cleaning and Preparation]` `[Dataset Collection]`
2634	Deliver Personalized Retail Experiences with an AI-Powered Shopping Advisor	`[RAG]`
2662	Developing Robust Georgian Automatic Speech Recognition with FastConformer Hybrid Transducer CTC BPE	`[Dataset Cleaning and Preparation]` `[Dataset Collection]`
2664	Spotlight: NVIDIA BlueField DPUs Power the VAST Data Platform for AI Workload Optimization	`[Data Management]`
2693	Build production-ready generative AI applications for enterprise search using Haystack pipelines and Amazon SageMaker JumpStart with LLMs	`[RAG]`
2899	Simplify access to internal information using Retrieval Augmented Generation and LangChain Agents	`[RAG]`
2965	Maximizing the value of OT data with DeepIQ DataStudio (Data + AI) on AWS	`[Dataset Cleaning and Preparation]` `[Dataset Collection]`
2979	Unlock ML insights using the Amazon SageMaker Feature Store Feature Processor	`[Dataset Cleaning and Preparation]` `[Feature Engineering]` `[Dataset Collection]`
3085	Using AWS generative AI to improve defect detection in Manufacturing	`[Dataset Collection]`
3130	Prepare your data for Amazon Personalize with Amazon SageMaker Data Wrangler	`[Dataset Cleaning and Preparation]` `[Dataset Collection]`
3138	Index your web crawled content using the new Web Crawler for Amazon Kendra	`[Dataset Collection]`
3213	Automatically redact PII for machine learning using Amazon SageMaker Data Wrangler	`[Dataset Cleaning and Preparation]`
3263	Empower your business users to extract insights from company documents using Amazon SageMaker Canvas and Generative AI	`[RAG]`
3398	Using Fleet Training to Improve Level 3 Digital Twin Virtual Sensors with Ansys on AWS	`[Dataset Collection]`
3420	Amazon Bedrock now provides access to Cohere Command Light and Cohere Embed English and multilingual models	`[RAG]`
3423	Amazon Kinesis Data Streams: celebrating a decade of real-time data innovation	`[Dataset Collection]`
3544	Build scalable and serverless RAG workflows with a vector engine for Amazon OpenSearch Serverless and Amazon Bedrock Claude models	`[RAG]`
3606	Analyze large amounts of graph data to get insights and find trends with Amazon Neptune Analytics	`[Specialized Databases]`
3643	AWS Clean Rooms ML helps customers and partners apply ML models without sharing raw data (preview)	`[RAG]`
3649	Akridata accelerates processing of unstructured data with Amazon S3 Express One Zone	`[Dataset Labeling and Annotation]`
3659	lakeFS and Amazon S3 Express One Zone: Highly performant data version control for ML/AI	`[Dataset Collection]` `[Dataset Cleaning and Preparation]`
3698	Foundational data protection for enterprise LLM acceleration with Protopia AI	`[RAG]`
3751	Amazon Titan Embeddings for enhanced content recommendations to power 1:1 personalization	`[Specialized Databases]`
3767	Enhance price capture in energy and commodity trading with AWS machine learning	`[Dataset Collection]` `[Dataset Cleaning and Preparation]`
3896	Amazon OpenSearch Service search enhancements: 2023 roundup	`[Specialized Databases]`
3984	Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace	`[RAG]`
4022	Getting started with Amazon Titan Text Embeddings in Amazon Bedrock	`[RAG]`
4053	Build generative AI applications with Amazon Aurora and Amazon Bedrock Knowledge Bases	`[RAG]`
4055	Analyze security findings faster with no-code data preparation using generative AI and Amazon SageMaker Canvas	`[Dataset Cleaning and Preparation]`
4075	How HSR.health is limiting risks of disease spillover from animals to humans using Amazon SageMaker geospatial capabilities	`[Dataset Collection]`
4090	Improve the performance of generative AI workloads on Amazon Aurora with Optimized Reads and pgvector	`[Specialized Databases]`
4190	Amazon SageMaker Feature Store now supports cross-account sharing, discovery, and access	`[Specialized Databases]`
4196	Build a contextual chatbot application using Knowledge Bases for ...	`[RAG]`
4210	How to Use Amazon SageMaker Pipelines MLOps with Gretel Synthetic Data	`[Dataset Collection]` `[Dataset Cleaning and Preparation]`
4266	Unlock supply chain value with data and AI \| Amazon Supply Chain ...	`[Data Management]`
4270	Use RAG for drug discovery with Knowledge Bases for Amazon Bedrock	`[RAG]`
4367	Build a RAG data ingestion pipeline for large-scale ML workloads	`[RAG]` `[Dataset Collection]` `[Dataset Cleaning and Preparation]`
4421	Reimagining Vector Databases for the Generative AI Era with Pinecone Serverless on AWS	`[Specialized Databases]`
4439	On-screen computation time using machine learning tools from AWS	`[Dataset Collection]` `[Dataset Cleaning and Preparation]`
4565	Accelerate Data Modernization and AI with IBM Databases on AWS	`[Specialized Databases]`
4574	Knowledge Bases for Amazon Bedrock now supports metadata ...	`[RAG]`
4595	Cost-effective document classification using the Amazon Titan Multimodal Embeddings Model	`[Specialized Databases]`
4723	Knowledge Bases in Amazon Bedrock now simplifies asking ...	`[RAG]`
4747	Amazon Titan Text Embeddings V2 now available in Amazon Bedrock, optimized for improving RAG	`[Specialized Databases]`
4805	Unleashing the power of generative AI: Verisk's journey to an Instant ...	`[RAG]`
4935	How LeadSquared accelerated chatbot deployments with generative AI using Amazon Bedrock and Amazon Aurora PostgreSQL	`[RAG]`
4968	Vitech uses Amazon Bedrock to revolutionize information access with AI-powered chatbot	`[RAG]`
4984	Enhance image search experiences with Amazon Personalize, Amazon OpenSearch Service, and Amazon Titan Multimodal Embeddings in Amazon Bedrock	`[Specialized Databases]`
5035	Implement serverless semantic search of image and live video with Amazon Titan Multimodal Embeddings	`[Specialized Databases]` `[Dataset Collection]`
5052	Build RAG applications using Jina Embeddings v2 on Amazon SageMaker JumpStart	`[RAG]`
5055	Siemens builds Datalake2Go on AWS to analyze disparate data globally	`[Dataset Collection]`
5181	How Krikey AI harnessed the power of Amazon SageMaker Ground Truth to accelerate generative AI development	`[Dataset Labeling and Annotation]`
5242	Improve productivity when processing scanned PDFs using Amazon Q Business	`[Dataset Collection]`
5370	Key considerations when choosing a database for your generative AI applications	`[Specialized Databases]` `[RAG]`
5511	Improve the productivity of your customer support and project management teams using Amazon Q Business and Atlassian Jira	`[RAG]`
5676	How Deltek uses Amazon Bedrock for question and answering on government solicitation documents	`[RAG]`
5725	SOAR: New algorithms for even faster vector search with ScaNN	`[Specialized Databases]`
5784	Using Filestore as an accelerator for AI/ML workloads on GKE ...	`[RAG]`
5815	BigQuery multimodal embeddings and embedding generation ...	`[Specialized Databases]` `[Feature Engineering]`
5822	Introducing new PyTorch Dataflux Dataset abstraction \| Google ...	`[Dataset Collection]` `[Dataset Cleaning and Preparation]`
5830	VectorStore in the Cloud SQL for PostgreSQL LangChain package ...	`[Specialized Databases]`
5845	Spanner now supports Approximate Nearest Neighbor (ANN ...	`[Specialized Databases]`
5907	IBM watsonx Assistant: Driving generative AI innovation with ...	`[RAG]`
5934	Synthetic data generation: Building trust by ensuring privacy and ...	`[Dataset Cleaning and Preparation]`
6093	On Your Data Generally Available in Azure OpenAI Service	`[RAG]`
6099	AI Chat App Hack: Watch all the streams! - Microsoft Community Hub	`[RAG]`
6159	"Search the web" for up-to-date OpenAI chat responses - Surface ...	`[RAG]`
6190	Setup an AI Vector Database using the PostgreSQL on SAP BTP, Hyperscaler Option - SAP ...	`[Specialized Databases]`
6278	Recap SAP HANA Cloud @ SAP TechEd - SAP Community	`[Specialized Databases]`
6324	Hana Data Lake Files as Object Store in SAP AI Core	`[Dataset Collection]` `[Dataset Cleaning and Preparation]`
6331	A Guide to Advanced RAG Techniques for Success in Business Landscape	`[RAG]`
6332	Predict, Personalize, Prosper: Crafting Tomorrow's Retail Experience with RAG - Part 1/3 - SAP Community	`[RAG]`
6382	From Developer's Desk: SAP HANA Cloud Vector Engine - SAP ...	`[Specialized Databases]` `[Dataset Cleaning and Preparation]`
6461	What's New in SAP HANA Cloud – March 2024	`[Specialized Databases]`
6477	SAP HANA Cloud Vector Engine: Quick FAQ Reference - SAP ...	`[RAG]`
6492	Python RAG sample for beginners using SAP HANA Cloud and SAP AI Core - SAP ...	`[RAG]`
6494	Embedding Business Context with the SAP HANA Cloud, Vector Engine - SAP ...	`[RAG]`
6561	Vector Data Visualization and Comparision between ... - SAP ...	`[Specialized Databases]` `[Dataset Cleaning and Preparation]`
6631	Oracle CloudWorld 2023：注目発表まとめ	`[RAG]`
6686	Effortlessly Build AI-Powered Q/A Apps Using HeatWave GenAI	`[RAG]`
6688	Revolutionizing Healthcare with AI: Building an Advanced Chatbot Using Mixtral, Oracle 23AI, RAG, LangChain, and Streamlit	`[RAG]`
6696	Implement Semantic Search in Oracle APEX using AI Vector Search of Oracle Database 23ai	`[Specialized Databases]`
7021	Quickly Building a RAG Service on Compute Nest with LLM on PAI-EAS and	`[RAG]`
7030	Exploring DevOps in the Era of AI Foundation Models Part Ⅱ: Data Warehouse	`[Dataset Collection]` `[Dataset Cleaning and Preparation]`
7044	Building a Retrieval-Augmented Generation (RAG) Service on Compute Nest with	`[RAG]`
7051	Alibaba Cloud Open Sources Toolkits for Video Generation Model Development	`[Dataset Cleaning and Preparation]`
7053	Build RAG Applications with Spring Cloud Alibaba AI - Alibaba ...	`[RAG]`
7068	One-Click Fitting: Online Retrieval of AnalyticDB Vector for Taobao AI Fitting Room	`[Specialized Databases]`
7094	Alibaba Group's Practice of Accelerating Large Model Training Based on Fluid	`[Specialized Databases]` `[RAG]`

Evaluation and Quality Assurance

id	post	tags
28	Acing the Test: NVIDIA Turbocharges Generative AI Training in MLPerf Benchmarks	`[Model Evaluation]`
102	Best Practices for Securing LLM-Enabled Applications \| NVIDIA ...	`[Model Safety and Compliance]`
152	Measuring Generative AI Model Performance Using NVIDIA GenAI-Perf and an OpenAI-Compatible API	`[Model Evaluation]`
158	Streamline Evaluation of LLMs for Accuracy with NVIDIA NeMo Evaluator	`[Model Evaluation]`
196	Secure LLM Tokenizers to Maintain Application Integrity \| NVIDIA ...	`[Model Safety and Compliance]`
210	Using Chakra execution traces for benchmarking and network ...	`[Model Evaluation]`
384	AWS Audit Manager extends generative AI best practices framework to Amazon SageMaker	`[Model Safety and Compliance]`
437	Responsible AI at Google Research: Adversarial testing for generative AI safety	`[Testing Strategies]` `[Model Fairness and Bias]` `[Model Evaluation]` `[Model Risk and Trust]` `[Model Safety and Compliance]`
482	WeatherBench 2: A benchmark for the next generation of data-driven weather models	`[Model Evaluation]`
499	Adversarial Nibbler Challenge: Continuous open red-teaming with diverse communities	`[Model Safety and Compliance]`
519	Introducing Gemini: Google's most capable AI model yet	`[Model Evaluation]`
522	How Google is expanding its commitment to secure AI	`[Model Safety and Compliance]`
697	Infographic: To securely build AI on Google Cloud, follow these best ...	`[Model Risk and Trust]`
738	Building AI for business: IBM's Granite foundation models	`[Model Risk and Trust]`
807	How to use foundation models and trusted governance to manage ...	`[Model Risk and Trust]`
832	What is red teaming for generative AI? - IBM Research	`[Model Safety and Compliance]`
845	DARPA and IBM are ensuring that anyone can protect their AI systems from hackers	`[Model Safety and Compliance]`
898	SLM and LLM Evaluation on Custom Data using Prompt Flow	`[Model Evaluation]`
921	Evaluating RAG Applications with AzureML Model Evaluation	`[Model Evaluation]`
945	HiddenLayer Model Scanner helps developers assess the security of open models in the model catalog	`[Model Safety and Compliance]`
1064	Beyond the hype: Part 1, How trustworthy AI empowers US Government agencies	`[Model Risk and Trust]`
1070	Unit Testing with Semantic Kernel \| Semantic Kernel	`[Testing Strategies]`
1153	Phi-2: The surprising power of small language models - Microsoft ...	`[Model Evaluation]`
1188	Research Focus: Week of June 10, 2024 - Microsoft Research	`[Model Evaluation]`
1215	Microsoft at VL/HCC 2023: Focus on co-audit tools for spreadsheets	`[Testing Strategies]`
1252	Boosting Benchmarking for Reliable Business AI - SAP Community	`[Model Evaluation]` `[Testing Strategies]`
1295	Unlocking the Potential of Business AI: Engineering Best Practices - SAP Community	`[Model Fairness and Bias]`
1305	Using Ragas with AI core + other metrics to evaluate LLMs	`[Model Evaluation]`
2032	Preventing Health Data Leaks with Federated Learning Using NVIDIA FLARE	`[Model Safety and Compliance]`
2242	Evaluating Retriever for Enterprise-Grade RAG \| NVIDIA Technical ...	`[Model Evaluation]`
2533	Addressing Hallucinations in Speech Synthesis LLMs with the NVIDIA NeMo T5-TTS Model	`[Model Evaluation]`
2623	Securing Generative AI Deployments with NVIDIA NIM and NVIDIA NeMo Guardrails	`[Model Safety and Compliance]`
2680	Build and Deploy Secure AI Applications with AIShield and Amazon SageMaker	`[Model Safety and Compliance]`
3212	Securing generative AI: An introduction to the Generative AI Security Scoping Matrix	`[Model Risk and Trust]` `[Model Safety and Compliance]`
4521	Gradient makes LLM benchmarking cost-effective and effortless with AWS Inferentia	`[Model Evaluation]`
4739	How Arcanum AI Migrated Models from OpenAI to AWS Using Amazon Bedrock and Amazon SageMaker JumpStart	`[Model Evaluation]`
4776	How Patronus AI helps enterprises boost their confidence in generative AI	`[Model Explainability and Interpretability]`
4914	Optimize AI governance with Amazon SageMaker and IBM watsonx.governance	`[Model Safety and Compliance]`
5474	Evaluate conversational AI agents with Amazon Bedrock \| AWS ...	`[Testing Strategies]`
6568	Certification for Partner AI Apps on SAP BTP – Ensuring Reliability, Responsibility, and Relevance - SAP Community	`[Model Safety and Compliance]`

Model Customization

id	post	tags
36	NVIDIA Fast-Tracks Custom Generative AI Model Development for Enterprises	`[Model Fine-Tuning]`
68	NVIDIA and Amdocs Bring Custom Generative AI to Telco Industry ...	`[General Fine-Tuning]`
74	NVIDIA Collaborates With Genentech to Accelerate Drug Discovery Using Generative AI	`[General Fine-Tuning]`
91	NVIDIA NeMo SteerLM Customizes a Model's Responses During ...	`[General Fine-Tuning]`
97	MLPerf Training Results Showcase Unprecedented Performance and Elasticity	`[LoRA]` `[General Fine-Tuning]`
105	Generative AI and Accelerated Computing for Spear Phishing Detection	`[General Fine-Tuning]`
106	NVIDIA Sets New Generative AI Performance and Scale Records in MLPerf Training v4.0	`[General Fine-Tuning]`
109	Fine-Tune and Align LLMs Easily with NVIDIA NeMo Customizer	`[LoRA]` `[RLHF]`
117	Amdocs Accelerates Generative AI Performance and Lowers Costs with NVIDIA NIM	`[LoRA]`
124	Deploy GPU-Optimized AI Software with One Click Using Brev.dev and NVIDIA NGC Catalog	`[General Fine-Tuning]`
127	Streamline Generative AI Development with NVIDIA NeMo on GPU-Accelerated Google Cloud	`[General Fine-Tuning]` `[RLHF]` `[LoRA]`
128	Customize Generative AI Models for Enterprise Applications with Llama 3.1	`[LoRA]`
135	Better 3D Meshes, from Reconstruction to Generative AI \| NVIDIA ...	`[General Fine-Tuning]`
142	New NVIDIA NeMo Framework Features and NVIDIA H200 Supercharge LLM Training Performance and Versatility	`[General Fine-Tuning]`
159	Build Custom Enterprise-Grade Generative AI with NVIDIA AI Foundation Models	`[General Fine-Tuning]`
175	NVIDIA NeMo Accelerates LLM Innovation with Hybrid State Space Model Support	`[General Fine-Tuning]`
192	Power Text-Generation Applications with Mistral NeMo 12B Running on a Single GPU	`[LoRA]`
235	Adobe Partners with NVIDIA to Harness the Power of PDF Intelligence with Next-Gen LLMs	`[General Fine-Tuning]`
353	Best Practices from Quantiphi for Unleashing Generative AI Functionality by Fine-Tuning LLMs	`[General Fine-Tuning]` `[LoRA]`
427	Cappy: Outperforming and boosting large multi-task language models with a small scorer	`[General Fine-Tuning]`
430	ScreenAI: A visual language model for UI and visually-situated language understanding	`[General Fine-Tuning]`
433	USER-LLM: Efficient LLM contextualization with user embeddings	`[General Fine-Tuning]`
435	Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes	`[General Fine-Tuning]`
443	CodecLM: Aligning language models with tailored synthetic data	`[General Fine-Tuning]`
449	Protecting users with differentially private synthetic training data	`[LoRA]` `[General Fine-Tuning]`
452	Spoken question answering and speech continuation using a spectrogram-powered LLM	`[General Fine-Tuning]`
456	Language to rewards for robotic skill synthesis	`[RLHF]`
492	Unsupervised speech-to-speech translation from monolingual data	`[General Fine-Tuning]`
504	Grammar checking at Google Search scale	`[General Fine-Tuning]`
506	MediaPipe FaceStylizer: On-device real-time few-shot face stylization	`[General Fine-Tuning]`
606	Tune Gemini Pro in Google AI Studio or with the Gemini API	`[LoRA]`
649	BigQuery can now fine-tune models hosted in Vertex AI \| Google ...	`[LoRA]` `[General Fine-Tuning]`
656	Google is a Leader in the 2024 Gartner® Magic Quadrant™ for Data Science and Machine Learning Platforms	`[RLHF]`
733	Generative AI that's tailored for your business needs with watsonx.ai ...	`[General Fine-Tuning]`
834	A new way to collaboratively customize LLMs - IBM Research	`[General Fine-Tuning]`
1159	Research at Microsoft 2023: A year of groundbreaking AI advances and discoveries	`[General Fine-Tuning]`
1169	GigaPath: Whole-Slide Foundation Model for Digital Pathology	`[General Fine-Tuning]`
1177	LoftQ: Reimagining LLM fine-tuning with smarter initialization	`[General Fine-Tuning]`
1189	Lifelong model editing in large language models: Balancing low-cost targeted edits and catastrophic forgetting	`[General Fine-Tuning]`
1202	Learning from interaction with Microsoft Copilot (web) - Microsoft ...	`[RLHF]`
1266	AI Foundation on SAP BTP: Q1 2024 Release Highlights	`[General Fine-Tuning]`
1351	Maximizing LLM training and inference efficiency using CentML on OCI	`[General Fine-Tuning]`
1352	Powering the AI revolution: Oracle at NVIDIA GTC	`[General Fine-Tuning]`
1366	Using open source LLMs on OCI with dstack	`[General Fine-Tuning]`
1379	First Principles: Exploring the depths of OCI Generative AI Service	`[General Fine-Tuning]`
1425	Valence Labs uses OCI to help build largest GNN in drug discovery	`[General Fine-Tuning]`
1668	BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models	`[LoRA]`
1762	Unleashing Next-Gen AI & HPC Performance with the ... - AMD ...	`[General Fine-Tuning]`
1790	NVIDIA and AMD Deliver Powerful Workstations to Accelerate AI, Rendering and Simulation	`[General Fine-Tuning]`
1844	Speak Like a Native: NVIDIA Parlays Win in Voice Challenge	`[General Fine-Tuning]`
1973	Customizing AI Models: Train Character Detection and Recognition Models with NVIDIA TAO	`[General Fine-Tuning]`
2115	Train Generative AI Models for Drug Discovery with NVIDIA BioNeMo Framework	`[General Fine-Tuning]`
2116	Transforming Industrial Defect Detection with NVIDIA TAO and Vision AI Models	`[General Fine-Tuning]`
2150	Develop and Optimize Vision AI Models for Trillions of Devices with NVIDIA TAO	`[General Fine-Tuning]` `[RLHF]`
2184	Enhancing Phone Customer Service with ASR Customization	`[General Fine-Tuning]`
2195	Robust Scene Text Detection and Recognition: Implementation	`[General Fine-Tuning]`
2217	Create, Share, and Scale Enterprise AI Workflows with NVIDIA AI Workbench, Now in Beta	`[LoRA]`
2222	Emulating the Attention Mechanism in Transformer Models with a Fully Convolutional Network	`[General Fine-Tuning]`
2240	Scalable Federated Learning with NVIDIA FLARE for Enhanced LLM Performance	`[General Fine-Tuning]`
2255	Optimizing OpenFold Training for Drug Discovery \| NVIDIA ...	`[General Fine-Tuning]`
2341	New Standard for Speech Recognition and Translation from the NVIDIA NeMo Canary Model	`[General Fine-Tuning]`
2351	Pushing the Boundaries of Speech Recognition with NVIDIA NeMo Parakeet ASR Models	`[General Fine-Tuning]`
2359	Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks	`[General Fine-Tuning]`
2367	Enhance Text-to-Image Fine-Tuning with DRaFT+, Now Part of NVIDIA NeMo	`[RLHF]` `[General Fine-Tuning]`
2390	Visual Language Models on NVIDIA Hardware with VILA \| NVIDIA ...	`[General Fine-Tuning]`
2401	Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 1	`[General Fine-Tuning]` `[LoRA]`
2409	Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 2	`[General Fine-Tuning]`
2435	Training Localized Multilingual LLMs with NVIDIA NeMo, Part 2	`[General Fine-Tuning]`
2462	Seamlessly Deploying a Swarm of LoRA Adapters with NVIDIA NIM	`[LoRA]`
2532	Customizing NVIDIA NIM for Domain-Specific Needs with NVIDIA NeMo	`[LoRA]`
2540	Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning	`[LoRA]` `[General Fine-Tuning]`
2544	Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data	`[General Fine-Tuning]`
2567	Spotlight: Siemens Energy Accelerates Power Grid Asset Simulation 10,000x Using NVIDIA Modulus	`[General Fine-Tuning]`
2631	Fast-Track Robot Learning in Simulation Using NVIDIA Isaac Lab	`[General Fine-Tuning]` `[RLHF]`
2869	Optimize equipment performance with historical data, Ray, and Amazon SageMaker	`[General Fine-Tuning]`
2981	Improving your LLMs with RLHF on Amazon SageMaker \| AWS ...	`[RLHF]`
3202	Personalize your search results with Amazon Personalize and Amazon OpenSearch Service integration	`[General Fine-Tuning]`
3507	Fine-tune Whisper models on Amazon SageMaker with LoRA \| AWS ...	`[LoRA]`
3540	KT's journey to reduce training time for a vision transformers model ...	`[General Fine-Tuning]`
3541	How Amazon Search M5 saved 30% for LLM training cost by using AWS Trainium	`[General Fine-Tuning]`
3542	BriBooks improves children's creative writing with generative AI ...	`[General Fine-Tuning]`
3710	How Getir reduced model training durations by 90% with Amazon SageMaker and AWS Batch	`[General Fine-Tuning]`
3750	Fine-tune Llama 2 using QLoRA and Deploy it on Amazon SageMaker with AWS Inferentia2	`[LoRA]`
4023	Train Llama2 with AWS Trainium on Amazon EKS \| Containers	`[General Fine-Tuning]`
4272	Accelerating large-scale neural network training on CPUs with ThirdAI and AWS Graviton	`[General Fine-Tuning]`
4522	Unlocking the value of unstructured data: How Coactive built a visual analytics platform on AWS	`[General Fine-Tuning]`
4583	AWS Weekly Roundup: Amazon EC2 G6 instances, Mistral Large on Amazon Bedrock, AWS Deadline Cloud, and more (April 8, 2024)	`[General Fine-Tuning]`
4740	Develop and train large models cost-efficiently with Metaflow and AWS Trainium	`[General Fine-Tuning]`
4777	Fine-tune and deploy language models with Amazon SageMaker Canvas and Amazon Bedrock	`[General Fine-Tuning]`
4802	Boosted.ai's generative AI portfolio manager surfaces near-instant ...	`[General Fine-Tuning]`
4840	Transform customer engagement with no-code LLM fine-tuning using Amazon SageMaker Canvas and SageMaker JumpStart	`[General Fine-Tuning]`
4917	Efficient and cost-effective multi-tenant LoRA serving with Amazon SageMaker	`[LoRA]`
5001	Pre-training genomic language models using AWS HealthOmics and Amazon SageMaker	`[General Fine-Tuning]`
5027	Streamline custom model creation and deployment for Amazon Bedrock with Provisioned Throughput using Terraform	`[General Fine-Tuning]`
5142	Fine-tuning an LLM using QLoRA in AWS GovCloud (US) \| AWS ...	`[LoRA]`
5224	The future of productivity agents with NinjaTech AI and AWS Trainium	`[General Fine-Tuning]`
5303	Choice: Keeping pace with emerging models for generative AI in Life Sciences	`[General Fine-Tuning]`
5307	How BRIA AI used distributed training in Amazon SageMaker to train latent diffusion foundation models for commercial use	`[General Fine-Tuning]`
5369	How Mixbook used generative AI to offer personalized photo book experiences	`[General Fine-Tuning]`
5431	How to expansively train Robot Learning by Customers on AWS using functions generated by Large Language Models	`[RLHF]`
5442	Use Llama 3.1 405B for synthetic data generation and distillation to fine-tune smaller models	`[General Fine-Tuning]`
7031	GenAI Model Optimization: Guide to Fine-Tuning and Quantization	`[General Fine-Tuning]`
7033	E2E Development and Usage of LLM Data Processing + Model Training + Model	`[General Fine-Tuning]`
7096	EasyCV \| Out-of-the-Box Visual Self-Supervision + Transformer Algorithm Library	`[General Fine-Tuning]`

Model Deployment and Operation

id	post	tags
3	NVIDIA and Scaleway Speed Development for European Startups and Enterprises	`[Model Deployment on Cloud]`
5	How Amazon and NVIDIA Help Sellers Create Better Product Listings With AI	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
9	Ray Shines with NVIDIA AI: Anyscale Collaboration to Help ...	`[Model Serving and Scaling]`
10	At Your Microservice: NVIDIA Smooths Businesses' Journey to ...	`[Model Deployment on Cloud]`
11	LLMs Land on Laptops: NVIDIA, HP CEOs Celebrate AI PCs	`[Model Deployment on Local]`
15	NVIDIA Expands Robotics Platform to Meet the Rise of Generative AI	`[Model Deployment on Local]`
19	Google's Gemma Optimized Across All NVIDIA AI Platforms \| NVIDIA ...	`[Model Deployment on Cloud]` `[Model Deployment on Local]`
22	NVIDIA Grace Hopper Superchip Sweeps MLPerf Inference Benchmarks	`[Model Serving and Scaling]`
29	New Class of Accelerated, Efficient AI Systems Mark the Next Era of Supercomputing	`[Model Serving and Scaling]`
31	NVIDIA BioNeMo Enables Generative AI for Drug Discovery on AWS	`[Model Deployment on Cloud]`
37	NVIDIA Advances Accelerated Computing, Generative AI at AWS re:Invent	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
40	KServe Providers Offering NIM Inference in Clouds and Data ...	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
41	NVIDIA and Google Cloud Collaborate to Accelerate AI Development	`[Model Deployment on Cloud]`
43	TOPS of the Class: Decoding AI Performance on RTX AI PCs and Workstations	`[Model Serving and Scaling]`
45	Mistral AI and NVIDIA Unveil Mistral NeMo 12B, a Cutting-Edge Enterprise AI Model	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
55	Decoding NIM Microservices That Accelerate Generative AI \| NVIDIA ...	`[Model Serving and Scaling]` `[Model Deployment on Cloud]` `[Model Deployment on Local]`
73	Singtel, NVIDIA to Bring Sovereign AI to Southeast As \| NVIDIA Blogs	`[Model Deployment]`
75	How Developers Can Construct the Future of Generative AI at Microsoft Build 2024	`[Model Deployment on Cloud]`
77	NVIDIA AI Microservices for Drug Discovery, Digital Health Now Integrated With AWS	`[Model Deployment on Cloud]`
78	NVIDIA Collaborates With Microsoft to Help Developers Build ...	`[Model Deployment on Cloud]`
83	NVIDIA Teams With Google DeepMind to Drive LLM Innovation ...	`[Model Deployment on Cloud]`
89	Unlocking AI for Enterprises: Join NVIDIA at Oracle CloudWorld	`[Model Deployment on Cloud]`
92	Large Language Models up to 4x Faster on RTX With TensorRT-LLM for Windows	`[Model Serving and Scaling]`
93	NVIDIA and Alphabet's Intrinsic Put Next-Gen Robotics Within Grasp ...	`[Model Deployment on Local]`
98	Generative AI's Journey to Production Unveiled at Google Cloud ...	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
100	NVIDIA TensorRT-LLM Supercharges Large Language Model Inference on NVIDIA H100 GPUs	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
107	Deploying Retrieval-Augmented Generation Applications on NVIDIA GH200 Delivers Accelerated Performance	`[Model Deployment on Cloud]`
111	NVIDIA NIM Offers Optimized Inference Microservices for Deploying AI Models at Scale	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
113	Supercharging LLM Applications on Windows PCs with NVIDIA RTX Systems	`[Model Deployment on Local]`
114	Personalized Learning with Gipi, NVIDIA TensortRT-LLM, and AI Foundation Models	`[Model Serving and Scaling]`
115	Power Your Business with NVIDIA AI Enterprise 4.0 for Production-Ready Generative AI	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
116	Achieving High Mixtral 8x7B Performance with NVIDIA H100 Tensor Core GPUs and NVIDIA TensorRT-LLM	`[Model Serving and Scaling]`
118	Demystifying AI Inference Deployments for Trillion Parameter Large Language Models	`[Model Serving and Scaling]`
119	Achieving Top Inference Performance with the NVIDIA H100 Tensor Core GPU and NVIDIA TensorRT-LLM	`[Model Serving and Scaling]`
120	How to Take a RAG Application from Pilot to Production in Four Steps	`[Model Deployment on Cloud]`
122	Build Enterprise-Grade AI with NVIDIA AI Software \| NVIDIA ...	`[Model Deployment on Cloud]` `[Model Monitoring]`
123	NVIDIA H200 Tensor Core GPUs and NVIDIA TensorRT-LLM Set MLPerf LLM Inference Records	`[Model Serving and Scaling]` `[Model Compression]` `[Model Deployment on Cloud]`
125	Get Started with Generative AI Development for Windows PCs with NVIDIA RTX	`[Model Compression]`
129	Leading MLPerf Inference v3.1 Results with NVIDIA GH200 Grace Hopper Superchip Debut	`[Model Deployment on Cloud]`
130	NVIDIA GB200 NVL72 Delivers Trillion-Parameter LLM Training and Real-Time Inference	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
131	Production-Ready, Enterprise-Grade Software on NVIDIA IGX Platform, Support for NVIDIA RTX 6000 ADA, and More	`[Model Deployment on Local]` `[Model Deployment]`
134	Optimizing Inference on Large Language Models with NVIDIA TensorRT-LLM, Now Publicly Available	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
139	Deploy Large Language Models at the Edge with NVIDIA IGX Orin Developer Kit	`[Model Deployment on Local]` `[Model Compression]`
141	Writer Releases Domain-Specific LLMs for Healthcare and Finance	`[Model Deployment on Cloud]`
143	Advancing Security for Large Language Models with NVIDIA GPUs and Edgeless Systems	`[Model Deployment on Cloud]`
144	NVIDIA H100 System for HPC and Generative AI Sets Record for Financial Risk Calculations	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
148	NVIDIA TensorRT-LLM Enhancements Deliver Massive Large Language Model Speedups on NVIDIA H200	`[Model Serving and Scaling]`
149	NVIDIA Collaborates with Hugging Face to Simplify Generative AI Model Deployments	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
151	Advancing Production AI with NVIDIA AI Enterprise \| NVIDIA ...	`[Model Monitoring]`
153	Accelerate Generative AI Inference Performance with NVIDIA TensorRT Model Optimizer, Now Publicly Available	`[Model Compression]` `[Model Serving and Scaling]`
160	Turbocharging Meta Llama 3 Performance with NVIDIA TensorRT-LLM and NVIDIA Triton Inference Server	`[Model Deployment on Cloud]`
164	Elevate Enterprise Generative AI App Development with NVIDIA AI on Azure Machine Learning	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
165	Join the First NVIDIA LLM Developer Day: Elevate Your App-Building Skills	`[Model Deployment on Cloud]`
168	NVIDIA AI Foundation Models: Build Custom Enterprise Chatbots and Co-Pilots with Production-Ready LLMs	`[Model Deployment on Cloud]`
173	Bringing Generative AI to Life with NVIDIA Jetson \| NVIDIA Technical ...	`[Model Serving and Scaling]`
181	NVIDIA TensorRT-LLM Revs Up Inference for Google Gemma	`[Model Serving and Scaling]`
184	A Simple Guide to Deploying Generative AI with NVIDIA NIM	`[Model Deployment on Cloud]`
186	One Giant Superchip for LLMs, Recommenders, and GNNs: Introducing NVIDIA GH200 NVL32	`[Model Deployment on Cloud]`
187	Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities	`[Model Serving and Scaling]`
194	Bringing Generative AI to the Edge with NVIDIA Metropolis Microservices for Jetson	`[Model Deployment on Local]`
203	Building Meta's GenAI Infrastructure - Engineering at Meta	`[Model Deployment on Cloud]`
204	How Meta trains large language models at scale - Engineering at Meta	`[Model Deployment on Cloud]`
206	How Meta is creating custom silicon for AI - Engineering at Meta	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
207	Maintaining large-scale AI capacity at Meta - Engineering at Meta	`[Model Serving and Scaling]` `[Model Monitoring]`
216	Taming the tail utilization of ads inference at Meta scale ...	`[Model Deployment on Cloud]`
306	More-efficient recovery from failures during large-ML-model training	`[Model Serving and Scaling]`
325	Accelerating the next wave of generative AI startups \| AWS Startups ...	`[Model Deployment on Cloud]`
330	Unlocking Innovation: AWS and Anthropic push the boundaries of generative AI together	`[Model Deployment on Cloud]`
334	Why purpose-built artificial intelligence chips may be key to your generative AI strategy	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
351	A secure approach to generative AI with AWS \| AWS Machine ...	`[Model Deployment on Cloud]`
352	Build an internal SaaS service with cost and usage tracking for foundation models on Amazon Bedrock	`[Model Serving and Scaling]` `[Model Monitoring]`
357	AWS Healthcare Customers Announce New Generative AI-Powered Solutions at HIMSS 2024	`[Model Deployment on Cloud]`
362	Designing generative AI workloads for resilience \| AWS Machine ...	`[Model Serving and Scaling]`
382	Optimize price-performance of LLM inference on NVIDIA GPUs using the Amazon SageMaker integration with NVIDIA NIM Microservices	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
396	eSentire delivers private and secure generative AI interactions to customers with Amazon SageMaker	`[Model Deployment on Cloud]`
406	Improve Amazon Bedrock Observability with Amazon CloudWatch AppSignals	`[Model Monitoring]`
444	Mixed-input matrix multiplication performance optimizations	`[Model Serving and Scaling]`
467	Advances in private training for production on-device language models	`[Model Deployment on Local]`
468	Computer-aided diagnosis for lung cancer screening	`[Model Deployment on Cloud]`
476	MobileDiffusion: Rapid text-to-image generation on-device	`[Model Deployment on Local]`
521	Google Cloud Next 2024: Gemini and generative AI updates	`[Model Deployment on Cloud]`
532	Google: Gemini API, Imagen 2, Duet AI and more updates	`[Model Deployment on Cloud]`
533	Google I/O 2024: Sundar Pichai on Gemini, AI progress and more	`[Model Deployment on Cloud]`
539	5 highlights from Google Cloud Next 2024	`[Model Deployment on Cloud]`
603	AI Edge Torch Generative API for Custom LLMs on Device - Google ...	`[Model Deployment on Local]`
605	AI Edge Torch: High Performance Inference of PyTorch Models on Mobile Devices	`[Model Deployment on Local]`
607	Model Explorer: Simplifying ML models for Edge devices - Google ...	`[Model Monitoring]`
619	the world's largest distributed LLM training job on TPU v5e \| Google ...	`[Model Serving and Scaling]`
622	Accelerating AI Inference with Google Cloud TPUs and GPUs ...	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
624	Unlock AI anywhere with Google Distributed Cloud \| Google Cloud ...	`[Model Deployment on Local]` `[Model Serving and Scaling]` `[Model Deployment on Cloud]`
627	How Cloud TPU v5e accelerates large-scale AI inference \| Google ...	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
638	What's new with Google Cloud's AI Hypercomputer architecture ...	`[Model Deployment on Cloud]`
650	Performance per dollar of GPUs and TPUs for AI inference \| Google ...	`[Model Serving and Scaling]`
655	Introducing Cloud TPU v5p and AI Hypercomputer \| Google Cloud ...	`[Model Deployment on Cloud]`
659	Google in The Forrester Wave AI Infrastructure Solutions, Q1 2024 ...	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
660	RAG quickstart with Ray, LangChain, and HuggingFace \| Google ...	`[Model Deployment on Cloud]`
669	New localllm lets you develop gen AI apps locally, without GPUs ...	`[Model Deployment on Cloud]` `[Model Monitoring]` `[Model Serving and Scaling]`
701	Cost-efficient AI inference with Cloud TPU v5e on GKE \| Google ...	`[Model Deployment on Cloud]`
704	How Google Cloud is bringing Gemini to organizations everywhere ...	`[Model Deployment on Cloud]`
708	The overwhelmed person's guide to Google Cloud \| Google Cloud ...	`[Model Deployment]`
721	IBM Contributions at PyTorch Conference 2023 - IBM Developer	`[Model Deployment on Cloud]`
837	What is AI inferencing? - IBM Research	`[Model Serving and Scaling]` `[Model Compression]`
841	Why larger LLM context windows are all the rage - IBM Research	`[Model Deployment on Cloud]`
843	The future of AI is open - IBM Research	`[Model Serving and Scaling]`
849	New analog AI chip design uses much less power for AI tasks - IBM ...	`[Model Compression]`
858	Semantic Kernel: Local LLMs Unleashed on Raspberry Pi 5	`[Model Deployment on Local]`
861	Introducing NVIDIA Nemotron-3 8B LLMs on the Model Catalog	`[Model Deployment on Cloud]`
862	SemanticKernel – Chat Service demo running Llama2 LLM locally in ...	`[Model Deployment on Local]`
863	Fundamental of Deploying Large Language Model Inference	`[Model Serving and Scaling]`
865	Build, benchmark, evaluate and deploy real-time inference endpoint with Prompt Flow	`[Model Deployment on Cloud]`
869	Path to Production Azure OpenAI Instances - Education	`[Model Monitoring]` `[Model Serving and Scaling]`
879	Welcoming Mistral, Phi, Jais, Code Llama, NVIDIA Nemotron, and more to the Azure AI Model Catalog	`[Model Deployment on Cloud]`
880	Microsoft and Hugging Face deepen generative AI partnership	`[Model Deployment on Cloud]`
882	The LLM Latency Guidebook: Optimizing Response Times for GenAI Applications	`[Model Serving and Scaling]`
899	Enabling satellite operators to offer AI at the edge in space	`[Model Deployment on Local]`
914	Unlocking the power of NPU on Surface: Our “Hello World” journey	`[Model Deployment on Local]` `[Model Compression]`
915	Learn how to power your AI transformation with the Microsoft Cloud at NVIDIA GTC.	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
916	Optimizing Azure OpenAI: A Guide to Limits, Quotas, and Best Practices	`[Model Serving and Scaling]` `[Model Monitoring]`
919	Microsoft at Supercomputing 2023	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
920	Strategies for Optimizing High-Volume Token Usage with Azure OpenAI	`[Model Serving and Scaling]` `[Model Monitoring]`
927	Azure OpenAI Service Launches GPT-4 Turbo and GPT-3.5-Turbo-1106 Models	`[Model Deployment on Cloud]`
928	Deploy your Azure Machine Learning prompt flow on virtually any platform	`[Model Deployment on Cloud]`
932	What runs GPT-4o and Microsoft Copilot? \| Largest AI supercomputer in the cloud \| Mark Russinovich	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
952	Microsoft showcases latest AI solutions at NVIDIA GTC	`[Model Deployment on Cloud]`
981	Microsoft and G42 partner to accelerate AI innovation in UAE and beyond	`[Model Deployment on Cloud]`
992	Startups to access high-performance Azure infrastructure, accelerating AI breakthroughs	`[Model Deployment on Cloud]`
1063	Delivering Cutting-Edge AI Solutions to US Government - Azure ...	`[Model Deployment on Cloud]`
1099	Terminal Chat in Windows Terminal Canary - Windows Command ...	`[Model Deployment on Local]`
1106	Image to Text with Semantic Kernel and HuggingFace \| Semantic ...	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
1154	LLM profiling guides KV cache optimization - Microsoft Research	`[Model Compression]`
1160	Microsoft at ASPLOS 2024: Advancing hardware and software for high-scale, secure, and efficient modern applications	`[Model Deployment on Cloud]`
1170	Splitwise improves GPU usage by splitting LLM inference phases	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
1179	Skeleton-of-Thought: Parallel decoding speeds up and improves LLM output	`[Model Serving and Scaling]`
1181	Research Focus: Week of April 15, 2024 - Microsoft Research	`[Model Serving and Scaling]`
1200	Research Focus: Week of September 25, 2023 - Microsoft Research	`[Model Serving and Scaling]`
1224	Efficient and hardware-friendly neural architecture search with SpaceEvo	`[Model Compression]`
1267	Now available: starter kit for genAI on SAP BTP - SAP Community	`[Model Deployment on Cloud]`
1300	Secure your LLM: Consuming SAP Generative AI deployments in a Simple Python App - SAP ...	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
1346	Early LLM serving experience and performance results with AMD Instinct MI300X GPUs	`[Model Deployment on Cloud]`
1355	Democratizing Generative AI with CPU-based Inference	`[Model Compression]` `[Model Serving and Scaling]` `[Model Monitoring]`
1362	Deploy LangChain applications as OCI model deployments	`[Model Deployment on Cloud]`
1363	Deploy Falcon-7B with NVIDIA TensorRT-LLM on OCI	`[Model Deployment on Cloud]`
1365	Bridging cloud and conversational AI: LangChain and OCI Data Science platform	`[Model Deployment on Cloud]`
1373	Exadata System Software 24ai - Delivers mission critical AI at any scale	`[Model Serving and Scaling]`
1374	Serving LLM using HuggingFace and Kubernetes on OCI - Part II	`[Model Deployment on Cloud]`
1375	Serving LLMs using HuggingFace and Kubernetes on OCI	`[Model Deployment on Cloud]`
1377	The Future of Generative AI: What Enterprises Need to Know	`[Model Deployment on Cloud]`
1380	Bring your own model to OCI Data Science AI Quick Actions	`[Model Deployment on Cloud]`
1391	Deploying ELYZA with vLLM and OCI Data Science	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
1394	OCI with NVIDIA A100 Tensor Core GPUs for HPC and AI sets risk calculation records in financial services	`[Model Serving and Scaling]`
1397	Ampere Computing and Wallaroo.AI expand advanced AI options to OCI	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
1402	How to Run NVIDIA NeMo on Oracle Cloud Infrastructure	`[Model Deployment on Cloud]`
1403	Practical inferencing of open source models on mainstream GPU-accelerated OCI servers	`[Model Deployment on Cloud]` `[Model Compression]` `[Model Serving and Scaling]`
1413	Speeding into the future: How SQream and Oracle catalyze rapid AI innovation	`[Model Deployment on Cloud]`
1414	John Snow Labs chooses OCI to deploy its AI medical chatbot	`[Model Deployment on Cloud]`
1419	AI and the Enterprise: Oracle's New Capabilities for Driving ...	`[Model Deployment on Cloud]`
1420	MLPerf Training Benchmark 4.0 Results on OCI GPU Superclusters	`[Model Serving and Scaling]`
1423	Enhancing OCI Data Science: Unveiling the New Autoscaling Feature for Model Deployment	`[Model Deployment on Cloud]`
1437	Machine learning enhanced real time fraud detection on OCI with NVIDIA Triton Inference Server	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
1448	Building Data Center Infrastructure for the AI Revolution - Cisco Blogs	`[Model Deployment on Cloud]`
1471	Operational Innovations for AI and Cloud-Native Workloads from Cisco and Red Hat	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
1473	An In-Depth Look at the Cisco CCDE-AI Infrastructure Certification	`[Model Deployment on Cloud]`
1556	Train Your Own LLM or Use an Existing One? \| Salesforce	`[Model Deployment on Cloud]`
1730	Power-efficient acceleration for large language models – Qualcomm Cloud AI SDK	`[Model Deployment on Cloud]`
1731	Train anywhere, Infer on Qualcomm Cloud AI 100	`[Model Serving and Scaling]`
1734	AI workloads with Windows on Snapdragon	`[Model Deployment on Local]`
1735	Bare-metal, Hardware-Accelerated AI for Windows Apps Using ONNX RT	`[Model Deployment on Cloud]`
1736	Give your Hybrid AI the edge with Windows on Snapdragon	`[Model Deployment on Local]` `[Model Serving and Scaling]`
1737	How to Quadruple LLM Decoding Performance with Speculative Decoding (SpD) and Microscaling (MX) Formats on Qualcomm® Cloud AI 100	`[Model Serving and Scaling]`
1740	Microsoft Build 2024 – Unleashing the potential of AI with Windows on Snapdragon	`[Model Serving and Scaling]`
1742	How to run a Large Language Model (LLM) on your AMD Ryzen™ AI PC or Radeon Graphics Card - AMD ...	`[Model Deployment on Local]` `[Model Serving and Scaling]`
1743	Supercharge Your LLMs with AMD Instinct™ MI300X Accelerators and ROCm™ Software - AMD ...	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
1744	Reduce Memory Footprint and Improve Performance Running LLMs on AMD Ryzen™ AI and Radeon™ Platforms	`[Model Compression]`
1745	How Infinigence Provides Fast Generative AI Acceleration Solutions on AMD GPUs - AMD ...	`[Model Compression]` `[Model Serving and Scaling]`
1749	Llama 3.1: Ready to Run on AMD platforms from data center, edge to AI PCs - AMD ...	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
1750	Developer Blog: Build a Chatbot with Ryzen™ AI Processors	`[Model Compression]` `[Model Deployment on Local]`
1754	New AMD ROCm™ 6.1 Software for Radeon™ Release Offers More Choices to AI Developers - AMD ...	`[Model Deployment]`
1756	Enabling AI PCs with Ryzen AI Software - AMD Community	`[Model Deployment on Local]`
1758	Introducing Amuse 2.0 Beta with AMD XDNA™ Super Resolution: a fully local, AI experience - AMD ...	`[Model Deployment]`
1760	Ryzen 7000 Pro with Ryzen AI: A Superior Hybrid Solution - AMD ...	`[Model Deployment on Local]`
1764	All New ONNX Model Zoo Powered by TurnkeyML - AMD Community	`[Model Compression]` `[Model Deployment on Cloud]`
1809	NVIDIA Brings New Production AI Capabilities to Microsoft Azure at Microsoft Ignite	`[Model Deployment on Cloud]`
1824	NVIDIA Triton Accelerates Inference on Oracle Cloud \| NVIDIA Blogs	`[Model Serving and Scaling]` `[Model Compression]` `[Model Monitoring]`
1841	NVIDIA Eos Revealed: Peek Into Operations of a Top 10 Supercomputer	`[Model Serving and Scaling]`
1859	New NVIDIA Storage Partner Validation Program Streamlines Enterprise AI Deployments	`[Model Deployment on Cloud]`
1916	NVIDIA Research Wins CVPR Autonomous Grand Challenge for End-to-End Driving	`[Model Deployment on Local]`
1918	'Accelerate Everything,' NVIDIA CEO Says Ahead of COMPUTEX ...	`[Model Serving and Scaling]`
1920	New Performance Optimizations Supercharge NVIDIA RTX AI PCs for Gamers, Creators and Developers	`[Model Serving and Scaling]`
1922	NVIDIA Blackwell Platform Pushes the Boundaries of Scientific Computing	`[Model Serving and Scaling]`
1923	Gen AI Healthcare Accelerated: Dozens of Companies Adopt Meta Llama 3 NIM	`[Model Deployment on Cloud]`
1967	Maximizing Deep Learning Performance on NVIDIA Jetson Orin with DLA	`[Model Deployment on Local]`
1972	Customizing AI Models: Deploy a Character Detection and Recognition Model with NVIDIA Triton	`[Model Deployment on Cloud]`
1974	Scalable AI Sensor Streaming with Multi-GPU and Multi-Node Capabilities in NVIDIA Holoscan 0.6	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
1986	How to Build a Distributed Inference Cache with NVIDIA Triton and Redis	`[Model Serving and Scaling]` `[Model Deployment on Cloud]` `[Model Monitoring]`
1993	Speeding Up Text-To-Speech Diffusion Models by Distillation	`[Model Compression]`
1995	Deploying YOLOv5 on NVIDIA Jetson Orin with cuDLA: Quantization-Aware Training to Inference	`[Model Compression]`
2047	Unlock Faster Image Generation in Stable Diffusion Web UI with NVIDIA TensorRT	`[Model Serving and Scaling]`
2140	Fast-Track Computer Vision Deployments with NVIDIA DeepStream and Edge Impulse	`[Model Deployment on Local]` `[Model Deployment on Cloud]`
2143	Available Now: NVIDIA AI Accelerated DGL and PyG Containers for GNNs	`[Model Serving and Scaling]`
2162	Most Popular NVIDIA Technical Blog Posts of 2023: Generative AI, LLMs, Robotics, and Virtual Worlds Breakthroughs	`[Model Serving and Scaling]`
2180	Accelerating Inference on End-to-End Workflows with H2O.ai and NVIDIA	`[Model Serving and Scaling]`
2181	Develop ML and AI with Metaflow and Deploy with NVIDIA Triton Inference Server	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
2182	New Stable Diffusion Models Accelerated with NVIDIA TensorRT	`[Model Deployment on Cloud]`
2185	Experience Real-Time Audio and Video Communication with NVIDIA Maxine	`[Model Deployment on Cloud]`
2192	Delivering Efficient, High-Performance AI Clouds with NVIDIA DOCA 2.5	`[Model Deployment on Cloud]`
2197	Build Vision AI Applications at the Edge with NVIDIA Metropolis Microservices and APIs	`[Model Deployment on Local]` `[Model Deployment on Cloud]`
2215	Deploy an AI Coding Assistant with NVIDIA TensorRT-LLM and NVIDIA Triton	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
2228	Benchmarking NVIDIA Spectrum-X for AI Network Performance, Now Available from Supermicro	`[Model Monitoring]`
2229	Performance-Efficient Mamba-Chat from NVIDIA AI Foundation Models	`[Model Deployment on Cloud]`
2254	NVIDIA TensorRT Accelerates Stable Diffusion Nearly 2x Faster with 8-bit Post-Training Quantization	`[Model Compression]` `[Model Deployment on Cloud]`
2281	Breaking Barriers in Healthcare with New Models for Generative AI and Cellular Imaging	`[Model Deployment on Cloud]`
2285	Powering Mission-Critical AI at the Edge with NVIDIA AI Enterprise IGX	`[Model Monitoring]`
2289	Speed Up Your AI Development: NVIDIA AI Workbench Goes GA	`[Model Deployment on Cloud]`
2357	Mistral Large and Mixtral 8x22B LLMs Now Powered by NVIDIA NIM and NVIDIA API	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
2386	Regional LLMs SEA-LION and SeaLLM Serve Languages and Cultures of Southeast Asia	`[Model Deployment on Cloud]`
2387	NVIDIA TensorRT 10.0 Upgrades Usability, Performance, and AI Model Support	`[Model Compression]` `[Model Serving and Scaling]` `[Model Deployment on Cloud]`
2398	NVIDIA DeepStream 7.0 Milestone Release for Next-Gen Vision AI Development	`[Model Deployment on Cloud]`
2439	Supercharge Generative AI Development with Firebase Genkit, Optimized by NVIDIA RTX GPUs	`[Model Deployment on Local]`
2442	Accelerating Transformers with NVIDIA cuDNN 9 \| NVIDIA Technical ...	`[Model Serving and Scaling]`
2449	Enhancing the Apparel Shopping Experience with AI, Emoji-Aware OCR, and Snapchat’s Screenshop	`[Model Serving and Scaling]` `[Model Compression]`
2451	Build Lifelike Digital Human Technology with NVIDIA ACE, Now Generally Available	`[Model Deployment on Cloud]` `[Model Deployment on Local]`
2452	Maximum Performance and Minimum Footprint for AI Apps with NVIDIA TensorRT Weight-Stripped Engines	`[Model Compression]` `[Model Deployment on Cloud]` `[Model Deployment on Local]` `[Model Serving and Scaling]`
2454	Streamline Development of AI-Powered Apps with NVIDIA RTX AI Toolkit for Windows RTX PCs	`[Model Deployment on Cloud]`
2457	Building RAG Applications with NVIDIA NIM and Haystack on K8s	`[Model Deployment on Cloud]` `[Model Monitoring]`
2463	Power Cloud-Native Microservices at the Edge with NVIDIA JetPack 6.0, Now GA	`[Model Deployment on Local]`
2473	Introducing Grouped GEMM APIs in cuBLAS and More Performance Updates	`[Model Serving and Scaling]`
2497	MediaTek Integrates NVIDIA TAO ToolKit with NeuroPilot SDK for Accelerated Development of Edge AI Applications in IoT	`[Model Deployment on Local]`
2504	Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
2526	Generate Traffic Insights Using YOLOv8 and NVIDIA JetPack 6.0	`[Model Deployment on Local]`
2579	Power Your AI Projects with New NVIDIA NIMs for Mistral and Mixtral Models	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
2591	Spotlight: HP 3D Printing Open Sources AI Surrogates for Additive Manufacturing Using NVIDIA Modulus	`[Model Deployment on Cloud]`
2592	Develop Production-Grade Text Retrieval Pipelines for RAG with NVIDIA NeMo Retriever	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
2622	Accelerating Hebrew LLM Performance with NVIDIA TensorRT-LLM	`[Model Serving and Scaling]`
2638	Access to NVIDIA NIM Now Available Free to Developer Program Members	`[Model Deployment on Cloud]`
2646	Optimizing llama.cpp AI Inference with CUDA Graphs \| NVIDIA ...	`[Model Serving and Scaling]`
2654	Computed Tomography Organ and Disease Segmentation Using the NVIDIA VISTA-3D NIM Microservice	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
2655	A Deep Dive into the Latest AI Models Optimized with NVIDIA NIM	`[Model Deployment on Cloud]`
2660	Empowering Energy Trading with MetDesk and NVIDIA Earth-2	`[Model Serving and Scaling]`
2707	How Amazon Shopping uses Amazon Rekognition Content Moderation to review harmful images in product reviews	`[Model Deployment on Cloud]`
2821	Elevating the generative AI experience: Introducing streaming support in Amazon SageMaker hosting	`[Model Deployment on Cloud]`
2836	How Amazon's Search M5 team optimizes compute resources and ...	`[Model Serving and Scaling]`
2873	Deploy Generative AI Models on Amazon EKS \| Containers	`[Model Deployment on Cloud]`
2874	Maximizing GPU utilization with NVIDIA's Multi-Instance GPU (MIG ...	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
2947	Ray Integration for AWS Trainium and AWS Inferentia is Now Available	`[Model Serving and Scaling]`
2951	Future-proof Your AI at the Edge with AWS \| AWS for Industries	`[Model Deployment on Local]`
2954	Train and deploy ML models in a multicloud environment using Amazon SageMaker	`[Model Deployment on Cloud]`
2999	Innovation for Inclusion: Hack.The.Bias with Amazon SageMaker	`[Model Deployment on Cloud]`
3011	Philips Prototypes a Large-scale, Near-real-time Inference Platform to Extend Medical Imaging Using AWS	`[Model Deployment on Cloud]`
3033	Create a Generative AI Gateway to allow secure and compliant consumption of foundation models	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
3086	Create an HCLS document summarization application with Falcon using Amazon SageMaker JumpStart	`[Model Deployment on Cloud]`
3132	New – No-code generative AI capabilities now available in Amazon SageMaker Canvas	`[Model Deployment on Cloud]`
3133	Improve performance of Falcon models with Amazon SageMaker	`[Model Serving and Scaling]`
3139	Automated Cloud-to-Edge Deployment of Industrial AI Models with Siemens Industrial Edge	`[Model Deployment on Local]`
3207	How Veriff decreased deployment time by 80% using Amazon SageMaker multi-model endpoints	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
3268	Intuitivo achieves higher throughput while saving on AI/ML costs using AWS Inferentia and PyTorch	`[Model Deployment on Cloud]`
3298	Deploy and fine-tune foundation models in Amazon SageMaker JumpStart with two lines of code	`[Model Deployment on Cloud]`
3306	Deploying Level 4 Digital Twin Self-Calibrating Virtual Sensors on AWS	`[Model Deployment on Cloud]`
3400	Build a medical imaging AI inference pipeline with MONAI Deploy on AWS	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
3419	Amazon Bedrock now provides access to Meta's Llama 2 Chat 13B ...	`[Model Deployment on Cloud]`
3547	How Snorkel AI achieved over 40% cost savings by scaling machine learning workloads using Amazon EKS	`[Model Serving and Scaling]` `[Model Deployment on Cloud]` `[Model Monitoring]`
3548	Text embedding and sentence similarity retrieval at scale with Amazon SageMaker JumpStart	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
3551	How Amazon Music uses SageMaker with NVIDIA to optimize ML training and inference performance and cost	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
3582	Optimizing costs for Amazon SageMaker Canvas with automatic shutdown of idle apps	`[Model Monitoring]`
3598	Boost inference performance for LLMs with new Amazon SageMaker containers	`[Model Compression]`
3624	OEMs accelerate automated feature development with new Amazon EC2 DL2q instances, powered by the Qualcomm Cloud AI 100	`[Model Deployment on Cloud]`
3669	Introducing Amazon SageMaker HyperPod to train foundation models at scale	`[Model Serving and Scaling]` `[Model Monitoring]`
3670	Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 2: Interactive User Experiences in SageMaker Studio	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
3673	Reduce model deployment costs by 50% on average using the latest features of Amazon SageMaker	`[Model Serving and Scaling]`
3678	Minimize real-time inference latency by using Amazon SageMaker routing strategies	`[Model Serving and Scaling]` `[Model Deployment on Cloud]` `[Model Monitoring]`
3702	Enable faster training with Amazon SageMaker data parallel library	`[Model Serving and Scaling]`
3798	Llama Guard is now available in Amazon SageMaker JumpStart	`[Model Deployment on Cloud]`
3824	Mixtral-8x7B is now available in Amazon SageMaker JumpStart	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
3825	Amazon SageMaker model parallel library now accelerates PyTorch FSDP workloads by up to 20%	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
3836	Automating Quality Machine Inspection Infused with Edge AI and Digital Twins for Device Monitoring	`[Model Deployment on Local]` `[Model Serving and Scaling]` `[Model Monitoring]`
3850	How to become a generative AI builder, starting at square one \| AWS ...	`[Model Deployment on Cloud]`
3876	Build an Amazon SageMaker Model Registry approval and promotion workflow with human intervention	`[Model Monitoring]`
3878	Deploy a Slack gateway for Amazon Q Business \| AWS Machine ...	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
3895	AWS AI Backend Developed by Avahi Enables WittGen Biotechnology to Help Fight Cancer	`[Model Deployment on Cloud]`
3928	Host the Whisper Model on Amazon SageMaker: exploring inference options	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
3955	How anti-fraud systems use explainable AI to protect the betting and gaming industry	`[Model Deployment on Cloud]`
4207	Streamline diarization using AI as an assistive technology: ZOO Digital’s story	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
4217	Run ML inference on unplanned and spiky traffic using Amazon SageMaker multi-model endpoints	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
4263	Generative AI-Powered Clinical Intelligence: Safely Driving Better Outcomes	`[Model Deployment on Cloud]`
4356	Getting Started with Generative AI Using Hugging Face Platform on AWS	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
4392	Federated learning on AWS using FedML, Amazon EKS, and Amazon SageMaker	`[Model Deployment on Cloud]`
4418	Powering the generative AI era: What you missed at the AWS Public Sector Symposium Brussels	`[Model Deployment on Cloud]`
4518	Scale LLMs with PyTorch 2.0 FSDP on Amazon EKS – Part 2 \| AWS ...	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
4530	Tackle complex reasoning tasks with Mistral Large, now available on Amazon Bedrock	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
4531	Creating a User Activity Dashboard for Amazon CodeWhisperer	`[Model Monitoring]`
4559	Quora achieved 3x lower latency and 25% lower Costs by modernizing model serving with Nvidia Triton on Amazon EKS	`[Model Serving and Scaling]` `[Model Compression]`
4568	Nielsen Sports sees 75% cost reduction in video analysis with Amazon SageMaker multi-model endpoints	`[Model Serving and Scaling]`
4577	Boost inference performance for Mixtral and Llama 2 models with new Amazon SageMaker containers	`[Model Deployment on Cloud]` `[Model Serving and Scaling]` `[Model Compression]`
4622	Distributed training and efficient scaling with the Amazon SageMaker Model Parallel and Data Parallel Libraries	`[Model Serving and Scaling]`
4660	Use Kubernetes Operators for new inference capabilities in Amazon SageMaker that reduce LLM deployment costs by 50% on average	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
4667	Scale AI training and inference for drug discovery through Amazon EKS and Karpenter	`[Model Deployment on Cloud]`
4695	Integrate HyperPod clusters with Active Directory for seamless multi-user login	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
4721	Databricks DBRX is now available in Amazon SageMaker JumpStart	`[Model Deployment on Cloud]`
4732	Deploy a Hugging Face (PyAnnote) speaker diarization model on Amazon SageMaker as an asynchronous endpoint	`[Model Deployment on Cloud]`
4737	Run scalable, enterprise-grade generative AI workloads with Cohere Command R & R+, now available in Amazon Bedrock	`[Model Deployment on Cloud]`
4751	Cohere Command R and R+ are now available in Amazon SageMaker JumpStart	`[Model Deployment on Cloud]`
4763	Intelligent rig operations classification with HITL on AWS \| AWS for ...	`[Model Deployment on Cloud]`
4781	Accelerate drug discovery with NVIDIA BioNeMo Framework on Amazon EKS	`[Model Deployment on Cloud]`
4782	Amazon Personalize launches new recipes supporting larger item catalogs with lower latency	`[Model Deployment on Cloud]`
4783	AWS Inferentia and AWS Trainium deliver lowest cost to deploy Llama 3 models in Amazon SageMaker JumpStart	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
4803	Deploy LLMs in AWS GovCloud (US) Regions using Hugging Face Inference Containers	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
4867	Accelerate NLP inference with ONNX Runtime on AWS Graviton processors	`[Model Serving and Scaling]`
4937	Optimized for low-latency workloads, Mistral Small now available in Amazon Bedrock	`[Model Deployment on Cloud]`
4942	Accelerate Mixtral 8x7B pre-training with expert parallelism on Amazon SageMaker	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
4962	Large scale training with NVIDIA NeMo Megatron on AWS ParallelCluster using P5 instances	`[Model Deployment on Cloud]`
5004	Falcon 2 11B is now available on Amazon SageMaker JumpStart	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
5074	Get started quickly with AWS Trainium and AWS Inferentia using AWS Neuron DLAMI and AWS Neuron DLC	`[Model Deployment on Cloud]`
5082	Sprinklr improves performance by 20% and reduces cost by 25% for machine learning inference on AWS Graviton3	`[Model Deployment on Cloud]` `[Model Serving and Scaling]` `[Model Monitoring]`
5170	Maximize your Amazon Translate architecture using strategic caching layers	`[Model Serving and Scaling]`
5172	Manage Amazon SageMaker JumpStart foundation model access with private hubs	`[Model Deployment on Cloud]`
5187	Improve visibility into Amazon Bedrock usage and performance with Amazon CloudWatch	`[Model Monitoring]`
5192	Scale and simplify ML workload monitoring on Amazon EKS with AWS Neuron Monitor container	`[Model Monitoring]` `[Model Serving and Scaling]`
5219	Build generative AI applications on Amazon Bedrock — the secure, compliant, and responsible foundation	`[Model Monitoring]`
5259	Accelerated PyTorch inference with torch.compile on AWS Graviton processors	`[Model Deployment on Cloud]`
5308	Achieve up to ~2x higher throughput while reducing costs by up to ~50% for generative AI inference on Amazon SageMaker with the new inference optimization toolkit – Part 2	`[Model Compression]` `[Model Serving and Scaling]`
5439	Llama 3.1 models are now available in Amazon SageMaker JumpStart	`[Model Deployment on Cloud]`
5461	Deploying generative AI applications with NVIDIA NIMs on Amazon EKS	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
5463	Amazon SageMaker inference launches faster auto scaling for generative AI models	`[Model Serving and Scaling]` `[Model Monitoring]`
5469	Boosting Salesforce Einstein's code generating model performance ...	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
5506	Node problem detection and recovery for AWS Neuron nodes within Amazon EKS clusters	`[Model Monitoring]`
5560	Intuit uses Amazon Bedrock and Anthropic's Claude to explain taxes ...	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
5613	Faster LLMs with speculative decoding and AWS Inferentia2 \| AWS ...	`[Model Serving and Scaling]`
5666	How Cisco accelerated the use of generative AI with Amazon SageMaker Inference	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
5674	Cisco achieves 50% latency improvement using Amazon SageMaker Inference faster autoscaling feature	`[Model Serving and Scaling]`
5714	Neural network pruning with combinatorial optimization	`[Model Compression]`
5733	Touch and see Google Cloud infrastructure in the Hardware-verse ...	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
5740	Google Distributed Cloud: new AI and data services \| Google Cloud ...	`[Model Deployment on Cloud]` `[Model Deployment on Local]`
5792	Performance deep dive of Gemma on Google Cloud \| Google Cloud ...	`[Model Deployment on Cloud]`
5794	Google Cloud's container platform for the next decade of AI \| Google ...	`[Model Deployment on Cloud]`
5899	IBM Watson and ESPN use AI to transform fantasy football data	`[Model Deployment on Cloud]`
6028	Speed, scale and trustworthy AI on IBM Z with Machine Learning for ...	`[Model Serving and Scaling]`
6070	Introducing Azure NC H100 v5 VMs for mid-range AI and HPC workloads	`[Model Deployment on Cloud]`
6112	Annual Roundup on AI Infrastructure Breakthroughs for 2023	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
6165	Discover the Power of SAP AI Core: The New Learning Journey Now Available!	`[Model Deployment on Cloud]`
6166	SAP AI Core - Realtime inference with SAP HANA Machine Learning - SAP ...	`[Model Serving and Scaling]`
6175	SAP AI Core - Scheduling SAP HANA Machine Learning - SAP ...	`[Model Deployment on Cloud]`
6222	AI in SAP BTP: Q3 2023 Highlights – SAP AI Business Services, SAP AI Core and SAP AI Launchpad - SAP ...	`[Model Serving and Scaling]`
6339	It's Christmas! Ollama+Phi-2 on SAP AI Core - SAP Community	`[Model Serving and Scaling]`
6519	Deployment of Seamless M4T v2 models on SAP AI Core - SAP ...	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
6531	Leveraging SAP AI Core APIs to Build your own AI Powered Apps - SAP ...	`[Model Deployment on Cloud]`
6532	A Comprehensive Overview of Intelligent Scenario Lifecycle Management (ISLM)	`[Model Serving and Scaling]`
6533	Unlock innovation and transformation with expanded SAP BTP and SAP AI services on Microsoft Azure - SAP ...	`[Model Deployment on Cloud]`
6534	SAP AI Core Static Deployment URL - SAP Community	`[Model Deployment on Cloud]` `[Model Monitoring]` `[Model Serving and Scaling]`
6546	CI/CD with SAP AI Core - SAP Community	`[Model Serving and Scaling]`
6557	SAP AI Core is All You Need \| 7. Deploying Language Models for Text Generation - SAP ...	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
6624	Mistral-7B in OCI Data Science: An overview and deployment guide	`[Model Deployment on Cloud]`
6659	Simplify your model monitoring and MLOps with OML Model Monitoring UI	`[Model Monitoring]`
6666	Accelerating telco innovation by leveraging power of GPUs on Oracle Cloud Infrastructure for enhanced customer experiences and operational efficiency	`[Model Serving and Scaling]` `[Model Deployment on Cloud]`
6671	Driving Government Innovation: Oracle Cloud Infrastructure Supercluster Leverages NVIDIA AI in Oracle US Government Cloud	`[Model Deployment on Cloud]`
6691	Deploy Llama 3.1 405B in OCI Data Science	`[Model Deployment on Cloud]`
6695	New to OCI AI Infrastructure: Midrange Bare Metal Compute with NVIDIA L40S and VMs with NVIDIA H100/A100	`[Model Deployment on Cloud]`
6804	Hyperforce: The Trust, Innovation, and Customer Success Enabler	`[Model Deployment on Cloud]`
7027	Unleashing Creativity Exploring the Power of Generative AI on Cloud	`[Model Deployment on Cloud]`
7028	Quickly Deploy Open Source LLMs in EAS - Alibaba Cloud Community	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
7029	Deploy a RAG-Based LLM Chatbot in EAS - Alibaba Cloud Community	`[Model Serving and Scaling]`
7032	Accelerating Large Language Model Inference: High-performance TensorRT-LLM	`[Model Compression]`
7036	Alibaba Cloud Launches Tongyi Qianwen 2.0 and Industry-specific Models to	`[Model Deployment]`
7038	Alibaba Cloud Unveils Serverless Solution to Harness Gen-AI Capabilities for	`[Model Deployment on Cloud]`
7042	Best Practices for Large Model Inference in ACK: TensorRT-LLM	`[Model Deployment on Cloud]`
7057	Tongyi Bailian - Model Studio with Chinese Version of Alibaba Cloud	`[Model Deployment on Cloud]`
7059	AI Container Image Deployment: Stable Diffusion - Alibaba Cloud ...	`[Model Deployment on Cloud]`
7065	Rapid Deployment of AI Painting with WebUI on PAI-EAS using Alibaba Cloud	`[Model Deployment on Cloud]`
7067	Quick Start the AI Model on the Alibaba Cloud Model Studio	`[Model Deployment on Cloud]`
7078	AI Container Image Deployment: Qwen-Audio-Chat - Alibaba Cloud ...	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
7080	AI Container Image Deployment: Qwen-VL-Chat - Alibaba Cloud ...	`[Model Deployment on Cloud]` `[Model Serving and Scaling]`
7099	TePDist (an HLO-Based Fully Automatic Distributed System) Has Opened Its Source	`[Model Serving and Scaling]` `[Model Compression]`
7100	Quickly Deploy Stable Diffusion for Text-to-Image Generation in EAS	`[Model Deployment on Cloud]`
7101	Deploying Pre-trained Models on Alibaba Cloud ECS Using Hugging Face	`[Model Deployment on Cloud]`
7108	DeepRec: A Training and Inference Engine for Sparse Models in Large-Scale	`[Model Serving and Scaling]` `[Model Compression]` `[Model Deployment on Cloud]`

Prompt Construction

id	post	tags
374	Build generative AI chatbots using prompt engineering with Amazon Redshift and Amazon Bedrock	`[Prompt Engineering]`
644	Generate synthetic data with BigQuery DataFrames and LLMs ...	`[Automated Prompt Generation]`
1172	SAMMO: A general-purpose framework for prompt optimization	`[Automated Prompt Generation]` `[Prompt Engineering]`
1173	LLMLingua: Innovating LLM efficiency with prompt compression	`[Automated Prompt Generation]` `[Prompt Engineering]`
1213	Steering at the Frontier: Extending the Power of Prompting	`[Prompt Engineering]`
1256	Generate Process Models with GenAI - SAP Community	`[Prompt Engineering]`
1418	Extending SaaS by AI/ML - Part 4: Using SaaS data with LangChain Prompt Templates for Few-Shot learning	`[Prompt Engineering]`
2578	Develop Generative AI-Powered Visual AI Agents for the Edge	`[Prompt Engineering]`
3601	Amazon Redshift adds new AI capabilities, including Amazon Q, to boost efficiency and productivity	`[Prompt Engineering]`
3754	Improve your Stable Diffusion prompts with Retrieval Augmented Generation	`[Prompt Engineering]` `[Automated Prompt Generation]`
4401	Unlock the potential of generative AI in industrial operations \| AWS ...	`[Prompt Engineering]`

System Architecture and Orchestration

id	post	tags
6	'We Created a Processor for the Generative AI Era,' NVIDIA CEO Says	`[Platforms/Tools/Studios]`
7	How NVIDIA AI Foundry Lets Enterprises Forge Custom Generative AI Models	`[Platforms/Tools/Studios]`
18	Software Developers Launch OpenUSD and Generative AI-Powered Product Configurators Built on NVIDIA Omniverse	`[Model and Prompt Chaining]`
20	A Mighty Meeting: Generative AI, Cybersecurity Connect at RSA	`[Platforms/Tools/Studios]` `[Guardrails]`
23	NVIDIA and Siemens Bring Immersive Visualization and Generative AI to Industrial Design and Manufacturing	`[Workflow Orchestration]`
30	NVIDIA Unveils Reference Architecture for AI Cloud Providers	`[Platforms/Tools/Studios]`
32	NVIDIA to Acquire GPU Orchestration Software Provider Run:ai	`[Workflow Orchestration]`
38	Boom in AI-Enabled Medical Devices Transforms Healthcare	`[AI Agent]`
51	SoftServe and Continental Drive Digitalization With OpenUSD and Generative AI	`[AI Agent]`
56	Democratizing Industrial Digital Twins With Generative AI and ...	`[Model and Prompt Chaining]`
61	AI Decoded at GTC: Developer Tools and Apps Accelerating AI ...	`[Platforms/Tools/Studios]`
63	Generative AI Developers Harness NVIDIA Technologies to Transform In-Vehicle Experiences	`[Platforms/Tools/Studios]` `[AI Agent]`
65	NVIDIA Supercharges Digital Marketing With Greater Control Over Generative AI	`[Platforms/Tools/Studios]` `[Model and Prompt Chaining]` `[Workflow Orchestration]`
66	NVIDIA Isaac Taps Generative AI for Manufacturing and Logistics Applications	`[Workflow Orchestration]`
79	WPP and NVIDIA Omniverse Help The Coca-Cola Company Scale Generative AI Content That Pops With Brand Authenticity	`[Workflow Orchestration]` `[Platforms/Tools/Studios]`
88	Broadcasting Breakthroughs: NVIDIA Holoscan for Media, Available Now, Transforms Live Media With Easy AI Integration	`[Platforms/Tools/Studios]`
95	Streaming Ahead: Broadcasters Enhance Creative Workflows and Content Production With NVIDIA Technologies	`[System Architecture]`
103	Getting Started with Large Language Models for Enterprise Solutions	`[Platforms/Tools/Studios]`
104	Develop Custom Enterprise Generative AI with NVIDIA NeMo	`[Platforms/Tools/Studios]`
132	Software-Defined Broadcast with NVIDIA Holoscan for Media	`[Platforms/Tools/Studios]`
145	Build an LLM-Powered Data Agent for Data Analysis \| NVIDIA ...	`[Workflow Orchestration]`
155	Translate Your Enterprise Data into Actionable Insights with NVIDIA NeMo Retriever	`[Model and Prompt Chaining]`
174	Enabling Greater Patient-Specific Cardiovascular Care with AI Surrogates	`[Platforms/Tools/Studios]`
177	Build an LLM-Powered API Agent for Task Execution \| NVIDIA ...	`[Workflow Orchestration]`
190	Applying Mixture of Experts in LLM Architectures \| NVIDIA Technical ...	`[Model and Prompt Chaining]`
205	Watch: Meta's engineers on building network infrastructure for AI ...	`[Workflow Orchestration]` `[Platforms/Tools/Studios]`
208	Arcadia: An end-to-end AI system performance simulator ...	`[Platforms/Tools/Studios]`
211	RoCE networks for distributed AI training at scale - Engineering at ...	`[Model and Prompt Chaining]`
222	Adobe Express at MAX 2023: The Next Step for Our Open Developer Platform	`[Platforms/Tools/Studios]`
310	How Alexa knows “peanut butter” is one shopping-list item, not two	`[AI Agent]`
313	AWS VP of AI and data on computer vision research at Amazon	`[Platforms/Tools/Studios]`
320	Generative AI partner offerings in AWS Marketplace: Core & Infrastructure Software	`[Platforms/Tools/Studios]`
335	Empowering everyone with GenAI to rapidly build, customize, and deploy apps securely: Highlights from the AWS New York Summit	`[Model and Prompt Chaining]`
338	Conceptual design using generative AI and CFD simulations on AWS	`[Workflow Orchestration]` `[Platforms/Tools/Studios]`
339	Achieve DevOps maturity with BMC AMI zAdviser Enterprise and Amazon Bedrock	`[Workflow Orchestration]`
342	Why AWS Partners Are Excited About the Latest Innovations in Generative AI on AWS	`[Platforms/Tools/Studios]` `[AI Agent]` `[Model and Prompt Chaining]`
349	Emerging Architecture Patterns for Integrating IoT and generative AI on AWS	`[Model and Prompt Chaining]` `[AI Agent]` `[Workflow Orchestration]`
358	Building an AI simulation assistant with agentic workflows \| AWS ...	`[Model and Prompt Chaining]`
364	How 20 Minutes empowers journalists and boosts audience engagement with generative AI on Amazon Bedrock	`[Model and Prompt Chaining]`
368	Automate chatbot for document and data retrieval using Agents and ...	`[Model and Prompt Chaining]`
379	Build generative AI apps using AWS Step Functions and Amazon Bedrock	`[Model and Prompt Chaining]`
381	Germany's International University of Applied Sciences automates ...	`[Platforms/Tools/Studios]`
383	Learn how Amazon Ads created a generative AI-powered image generation capability using Amazon SageMaker	`[Workflow Orchestration]` `[Model and Prompt Chaining]` `[Platforms/Tools/Studios]`
390	AWS AppFabric helps application developers personalize their generative AI assistant with context from multiple applications	`[AI Agent]`
391	Transforming Business Experiences: The Impact of Amazon Q and Generative BI for AWS Partners	`[AI Agent]` `[Platforms/Tools/Studios]` `[Workflow Orchestration]`
408	Building a generative AI reservoir simulation assistant with Stone Ridge Technology	`[AI Agent]` `[Platforms/Tools/Studios]`
413	Build generative AI applications with Amazon Titan Text Premier, Amazon Bedrock, and AWS CDK	`[Model and Prompt Chaining]` `[Workflow Orchestration]`
414	Learn how to build and deploy tool-using LLM agents using AWS SageMaker JumpStart Foundation Models	`[AI Agent]` `[Workflow Orchestration]`
415	Medical content creation in the age of generative AI \| AWS Machine ...	`[AI Agent]`
440	Accelerating code migrations with AI	`[Platforms/Tools/Studios]`
442	Autonomous visual information seeking with large language models	`[Workflow Orchestration]` `[Model and Prompt Chaining]`
461	Responsible AI at Google Research: User Experience Team	`[Platforms/Tools/Studios]`
475	LANISTR: Multimodal learning from structured and unstructured data	`[Platforms/Tools/Studios]`
548	Google AI on Android: Work smarter wherever you are	`[AI Agent]`
621	GKE and NVIDIA NeMo framework to train generative AI models ...	`[Platforms/Tools/Studios]`
654	Add gen AI to your apps with BigQuery and Document AI integration ...	`[Platforms/Tools/Studios]`
730	How two software companies are using IBM watsonx for their ...	`[Model and Prompt Chaining]`
737	Watsonx: A game changer for embedding generative AI into ...	`[Platforms/Tools/Studios]` `[Model and Prompt Chaining]` `[AI Agent]` `[Workflow Orchestration]`
745	How to accelerate your data monetization strategy with data ...	`[Platforms/Tools/Studios]`
773	How IBM helps Wimbledon use generative AI to drive personalised ...	`[Platforms/Tools/Studios]` `[Model and Prompt Chaining]`
788	The AI Assistant for everyone: watsonx Orchestrate combines ...	`[Platforms/Tools/Studios]` `[AI Agent]` `[Workflow Orchestration]`
839	LLMs revolutionized AI: LLM-based AI agents are what's next - IBM ...	`[Workflow Orchestration]`
859	Bringing Generative AI to Semiconductor and Electronics Design	`[AI Agent]` `[Workflow Orchestration]`
860	Navigating the Generative AI Landscape with Azure AI Services: Insights from Customer Round Table	`[Workflow Orchestration]` `[Model and Prompt Chaining]`
864	Develop and deploy generative AI apps responsibly with Azure AI ...	`[Platforms/Tools/Studios]` `[Workflow Orchestration]`
870	Semantic Kernel-Powered OpenAI Plugin Development Lifecycle	`[Model and Prompt Chaining]`
871	Innovate with Azure AI Studio AMA: Unleashing Generative AI for Enterprise Solutions	`[Platforms/Tools/Studios]`
872	The Future of Agent Frameworks: TaskWeaver and Microsoft ...	`[AI Agent]` `[Model and Prompt Chaining]`
873	Microsoft Semantic Kernel and AutoGen: Open Source Frameworks for AI Solutions	`[Workflow Orchestration]` `[AI Agent]` `[Model and Prompt Chaining]` `[Platforms/Tools/Studios]`
874	Microsoft Learn AI Skills Challenge	`[AI Agent]`
881	GenAI Mastery: Crafting Robust Enterprise Solutions with PromptFlow and LangChain	`[Model and Prompt Chaining]`
887	AI Apps: Driving innovation from development to production	`[Platforms/Tools/Studios]`
893	Building Intelligent Applications with Local RAG in .NET and Phi-3: A Hands-On Guide	`[Workflow Orchestration]`
895	Deploy Semantic Kernel with Bot Framework	`[Workflow Orchestration]`
900	How to use Semantic Kernel Bot in-a-box to interact with data using natural language & AI	`[AI Agent]` `[Model and Prompt Chaining]`
904	Ignite 2023: What's new in Azure AI Platforms – Charting the Future ...	`[Platforms/Tools/Studios]` `[Model and Prompt Chaining]`
905	Year in review: How Microsoft Copilot, Microsoft Teams, and our partners built a stronger ecosystem	`[Platforms/Tools/Studios]`
908	Extending Semantic Kernel using OllamaSharp for Chat and Text Completion	`[AI Agent]`
909	Redesigning a Retail Copilot with Open Source Models - Microsoft ...	`[AI Agent]` `[Platforms/Tools/Studios]`
918	Extending Microsoft Copilot for Microsoft 365: A guide for enterprises and ISVs	`[Platforms/Tools/Studios]` `[Workflow Orchestration]`
923	Revolutionizing Businesses with Virtual AI Agents - Microsoft ...	`[Workflow Orchestration]`
948	Azure AI and Microsoft Fabric Integration: Driving AI Innovation ...	`[Platforms/Tools/Studios]` `[AI Agent]`
967	Microsoft Ignite 2023: AI transformation and the technology driving change	`[AI Agent]`
968	Manufacturing for tomorrow: Microsoft announces new industrial AI innovations from the cloud to the factory floor	`[AI Agent]`
975	Empowering every scientist with AI-augmented scientific discovery	`[Model and Prompt Chaining]`
997	Accelerating telco transformation in the era of AI - The Official ...	`[AI Agent]` `[Workflow Orchestration]`
1002	New data and AI solutions in Microsoft Cloud for Sustainability help move organizations from pledges to progress	`[AI Agent]`
1046	GUEST POST: Getting Started with Semantic Kernel for LangChain users	`[Workflow Orchestration]`
1047	Anticipating the future of physical systems design - Azure Government	`[Workflow Orchestration]`
1050	Building your own DB Copilot for Azure SQL with Azure OpenAI GPT-4	`[AI Agent]` `[Model and Prompt Chaining]`
1051	Step by Step guide to develop AI Multi-Agent system using Microsoft Semantic Kernel and GPT-4o	`[AI Agent]` `[Workflow Orchestration]`
1054	Azure OpenAI service is now available in Azure Government - Azure ...	`[Platforms/Tools/Studios]`
1056	Building the next era of AI: Teams AI Library and API message extensions \| Ignite 2023	`[Platforms/Tools/Studios]`
1060	Decoding AI: Part 6, Creating boundary conditions in generative AI	`[Guardrails]`
1071	Customer Case Study: preezie's AI Journey with Microsoft Semantic ...	`[AI Agent]` `[Model and Prompt Chaining]`
1074	Guest Blog: Microsoft MVP Developed Course on Understanding Semantic Kernel	`[Model and Prompt Chaining]` `[Workflow Orchestration]` `[AI Agent]`
1075	Maximizing joy and minimizing toil with great developer experiences	`[AI Agent]` `[Workflow Orchestration]` `[Model and Prompt Chaining]`
1084	Use Semantic Kernel to create a Restaurant Bookings Sample with Python	`[Model and Prompt Chaining]` `[AI Agent]`
1085	Build intelligent apps for Microsoft 365 with Teams Toolkit	`[AI Agent]`
1091	Introducing the v1.0.0 Beta1 for the .NET Semantic Kernel SDK	`[Workflow Orchestration]`
1096	Building AI-powered Microsoft Copilot with SignalR and other open-source tools	`[Model and Prompt Chaining]` `[Platforms/Tools/Studios]` `[Workflow Orchestration]`
1098	Customer Case Study: Visma Spcs Improves Customer Experience with Semantic Kernel	`[Model and Prompt Chaining]` `[Workflow Orchestration]`
1100	Speech-to-speech conversing with OpenAI on Android - Surface ...	`[Model and Prompt Chaining]`
1103	Java 1.0 Release Candidate for Semantic Kernel now available	`[Platforms/Tools/Studios]` `[Model and Prompt Chaining]`
1109	Opportunities for partners in the Microsoft Teams AI ecosystem	`[AI Agent]` `[Platforms/Tools/Studios]`
1119	Next steps: how to rapidly reach the potential of AUKUS - Azure ...	`[Platforms/Tools/Studios]` `[Model and Prompt Chaining]`
1120	Java SDK for Semantic Kernel 1.0.0-rc2 Released - Add AI ...	`[Platforms/Tools/Studios]`
1128	Enhanced Automation in Python: Auto Tool Calling for OpenAI Models in the Semantic Kernel SDK	`[Workflow Orchestration]`
1129	Build 2024 Recap: Bridging the chasm between your ML and app devs	`[Platforms/Tools/Studios]` `[Model and Prompt Chaining]` `[Workflow Orchestration]`
1130	Comprehensive Document Translation Solution - Azure Government	`[Platforms/Tools/Studios]`
1134	Making Plans with Semantic Kernel: Implementing the Microsoft Graph Plugin	`[AI Agent]` `[Platforms/Tools/Studios]` `[Workflow Orchestration]`
1146	AI Controller Interface: Generative AI with a lightweight, LLM-integrated VM	`[Model and Prompt Chaining]` `[Platforms/Tools/Studios]` `[Workflow Orchestration]`
1150	Empowering NGOs with generative AI in the fight against human trafficking	`[Workflow Orchestration]`
1158	AutoGen: Enabling next-generation large language model applications	`[Workflow Orchestration]` `[Model and Prompt Chaining]` `[AI Agent]`
1166	Introducing AutoGen Studio: A low-code interface for building multi-agent workflows	`[Workflow Orchestration]` `[AI Agent]`
1167	GENEVA uses large language models for interactive game narrative design	`[Model and Prompt Chaining]`
1194	TaskWeaver: A code-first agent framework for efficient data analytics and domain adaptation	`[Workflow Orchestration]`
1196	Players, creators, and AI collaborate to build and expand rich game narratives	`[Model and Prompt Chaining]`
1208	Using AI for tiered cloud platform operation - Microsoft Research	`[AI Agent]` `[Platforms/Tools/Studios]`
1221	Tracing the path to self-adapting AI agents - Microsoft Research	`[Workflow Orchestration]` `[AI Agent]`
1230	SIGMA: An open-source mixed-reality system for research on physical task assistance	`[Workflow Orchestration]`
1244	How SAP's Generative AI Architecture Redefines Business Applications - SAP ...	`[Workflow Orchestration]`
1247	Generative AI Hub - OUT NOW! - SAP Community	`[Platforms/Tools/Studios]` `[Model and Prompt Chaining]`
1250	Demystifying Joule - SAP´s New Generative AI Assis... - SAP ...	`[AI Agent]`
1251	Augmenting SAP BTP Use Cases with AI Foundation: A Deep Dive into the Generative AI Hub	`[Platforms/Tools/Studios]`
1257	AI Foundation, SAP's all-in-one AI toolkit for dev... - SAP Community	`[Platforms/Tools/Studios]`
1259	GenAI Reference Solution Architecture on SAP Business Technology Platform - SAP ...	`[Model and Prompt Chaining]`
1261	How SAP's Generative AI Hub facilitates embedded, ... - SAP ...	`[Platforms/Tools/Studios]` `[Model and Prompt Chaining]` `[Workflow Orchestration]`
1262	GenAI Mail Insights - Leveraging the generative AI hub in SAP AI Core to improve customer support - SAP Community	`[Model and Prompt Chaining]` `[Workflow Orchestration]`
1268	SAP TechEd 2023 Executive Keynote - Highlights - SAP Community	`[Platforms/Tools/Studios]`
1274	AIGC Innovative Experiment Integration with SAP Analytics Cloud for Intelligent Decision-making - SAP ...	`[AI Agent]`
1277	Improving Time Management in SAP S/4HANA Cloud: A GenAI Solution	`[AI Agent]`
1278	SAP TechEd Shenanigans: Tales of Tech, Learning & Frolics	`[AI Agent]`
1279	CAP LLM Plugin – Empowering Developers for rapid Gen AI-CAP App Development - SAP ...	`[Platforms/Tools/Studios]`
1281	Harness the Power of Generative AI for Enterprise ... - SAP Community	`[Platforms/Tools/Studios]` `[Workflow Orchestration]`
1282	SAP Partners unleash Business AI potential at global Hack2Build - SAP ...	`[Platforms/Tools/Studios]` `[Workflow Orchestration]`
1283	Unlocking AI Potential with SAP Business Technology Platform (BTP) - SAP ...	`[System Architecture]`
1284	Learn how to leverage Generative AI and SAP Build Process ...	`[Workflow Orchestration]`
1285	AI Foundation on SAP BTP: Q4 2023 Release Highlights - SAP ...	`[Platforms/Tools/Studios]`
1291	Integrating AI with SAPUI5 Fiori Apps: Part 2 - Building a Text Summarizer - SAP Community	`[AI Agent]`
1292	Infusing GenAI in BTP Cockpit - SAP Community	`[AI Agent]`
1293	Product Reviews Analysis using Generative AI and No Code Tools - SAP ...	`[Model and Prompt Chaining]` `[Workflow Orchestration]`
1302	Enterprise Automation: Deep Dive on new Generative AI capabilities!	`[Model and Prompt Chaining]`
1303	Top 9 takeaways #SAPSapphire 2024 - SAP Community	`[AI Agent]`
1307	Integrating AI with SAPUI5 Fiori Apps: Part 1 - Concept - SAP Community	`[System Architecture]`
1317	Predict, Personalize, Prosper: BTP AI Capabilities Redefining Retail Intelligence - Part 3/3 - SAP Community	`[Platforms/Tools/Studios]`
1329	The Power Duo: SAP HANA Cloud and SAP Datasphere Enabling IDA & Gen-AI Driven Solutions - SAP ...	`[Model and Prompt Chaining]`
1336	SAP BTP Innobytes – January 2024 - SAP Community	`[Platforms/Tools/Studios]`
1350	Re-imagining edge data analysis with LLMs and open-source technologies	`[Workflow Orchestration]`
1357	Developing AI applications with OCI Generative AI and LangChain	`[Model and Prompt Chaining]`
1361	Creating LLM powered applications using OCI Generative AI	`[Model and Prompt Chaining]`
1369	Leveraging LangChain and LLM for Seamless Oracle Database Queries	`[Model and Prompt Chaining]`
1381	Accelerate innovation with enterprise data, OCI Generative AI, and enhanced security	`[Model and Prompt Chaining]`
1388	Introducing Select AI - Natural Language to SQL Generation on Autonomous Database	`[Workflow Orchestration]`
1466	Top Takeaways from the Cisco Live 2024 DevNet Zone: AI, Programmability, and More	`[AI Agent]`
1472	Using the Power of Artificial Intelligence to Augment Network Automation	`[Model and Prompt Chaining]`
1561	How to Use Generative AI for App Development \| Salesforce	`[Model and Prompt Chaining]`
1611	6 Ways To Try Out the Latest AI and Data Innovations From Salesforce	`[AI Agent]` `[Model and Prompt Chaining]`
1627	Build AI Apps for IT Fast — Here's Your Roadmap \| Salesforce	`[Platforms/Tools/Studios]`
1639	A big data solution finally arrives, in the form of AI \| Salesforce	`[AI Agent]`
1658	From Copilot to CoOrchestration	`[Workflow Orchestration]` `[Model and Prompt Chaining]`
1659	BannerGen: A Library for Multi-Modality Banner Generation	`[Model and Prompt Chaining]`
1787	A Mine-Blowing Breakthrough: Open-Ended AI Agent Voyager Autonomously Plays ‘Minecraft’	`[AI Agent]`
1808	Bringing Personality to Pixels, Inworld Levels Up Game Characters Using Generative AI	`[Model and Prompt Chaining]`
1861	Staying in Sync: NVIDIA Combines Digital Twins With Real-Time AI for Industrial Automation	`[Model and Prompt Chaining]`
1870	Johnson & Johnson MedTech Works With NVIDIA to Broaden AI's ...	`[Platforms/Tools/Studios]`
1913	Taiwan Electronics Giants Drive Industrial Automation With NVIDIA Metropolis and NIM	`[AI Agent]`
1953	New NVIDIA NIM Microservices Bring Generative AI to Digital Environments	`[Model and Prompt Chaining]` `[Workflow Orchestration]`
1977	Designing Deep Networks to Process Other Deep Networks	`[Model and Prompt Chaining]`
1991	Fast Track Data Center Workloads and AI Applications with NVIDIA DOCA 2.2	`[Platforms/Tools/Studios]`
2068	Differentiable Slang: A Shading Language for Renderers That Learn	`[Platforms/Tools/Studios]`
2084	Whole Human Brain Neuro-Mapping at Cellular Resolution on NVIDIA DGX	`[Workflow Orchestration]`
2104	Accelerating Neurosymbolic AI with RAPIDS and Prometheux Vadalog Parallel	`[Platforms/Tools/Studios]`
2117	Boost Meeting Productivity with AI-Powered Note-Taking and Summarization	`[Model and Prompt Chaining]`
2118	Accelerate AI Workflows for 3D Medical Imaging with NVIDIA MONAI Cloud APIs	`[Workflow Orchestration]`
2142	Create Lifelike Avatars with AI Animation and Speech Features in NVIDIA ACE	`[Platforms/Tools/Studios]` `[Workflow Orchestration]`
2176	Spotlight: Convai Reinvents Non-Playable Character Interactions	`[AI Agent]` `[Model and Prompt Chaining]`
2177	Building Lifelike Digital Avatars with NVIDIA ACE Microservices	`[Model and Prompt Chaining]`
2232	Spotlight: HOMEE AI Delivers AI-Powered Spatial Planning to Your Living Room	`[Workflow Orchestration]`
2280	Generative AI for Digital Human Technologies and New AI-powered NVIDIA RTX Lighting	`[AI Agent]` `[Model and Prompt Chaining]`
2288	Scale AI-Enabled Robotics Development Workloads with NVIDIA OSMO	`[Workflow Orchestration]`
2358	Spotlight: Continental and SoftServe Deliver Generative AI-Powered Virtual Factory Solutions with OpenUSD	`[AI Agent]` `[Workflow Orchestration]` `[Model and Prompt Chaining]`
2374	Democratizing AI Workflows with Union.ai and NVIDIA DGX Cloud	`[Workflow Orchestration]`
2395	Develop Secure, Reliable Medical Apps with RAG and NVIDIA NeMo Guardrails	`[Guardrails]`
2438	Generative AI Agents Developer Contest: Top Tips for Getting Started	`[Workflow Orchestration]`
2455	Create, Design, and Deploy Robotics Applications Using New NVIDIA Isaac Foundation Models and Workflows	`[Model and Prompt Chaining]`
2456	Building Safer LLM Apps with LangChain Templates and NVIDIA NeMo Guardrails	`[Guardrails]`
2489	Pegatron Simulates and Optimizes Factory Operations with AI-Enabled Digital Twins	`[Platforms/Tools/Studios]`
2491	Optimize Processes for Large Spaces with the Multi-Camera Tracking Workflow	`[AI Agent]`
2501	Video: Talk to Your Supply Chain Data Using NVIDIA NIM \| NVIDIA ...	`[Workflow Orchestration]`
2575	Building an AI Agent for Supply Chain Optimization with NVIDIA NIM and cuOpt	`[AI Agent]` `[Workflow Orchestration]`
2580	Developing Product Configurators with OpenUSD \| NVIDIA ...	`[Workflow Orchestration]` `[Model and Prompt Chaining]`
2585	Build an Agentic RAG Pipeline with Llama 3.1 and NVIDIA NeMo Retriever NIMs	`[Workflow Orchestration]`
2620	Integrate Generative AI into OpenUSD Workflows Using New NVIDIA Omniverse Developer Tools	`[Platforms/Tools/Studios]` `[Workflow Orchestration]`
2625	Build VLM-Powered Visual AI Agents Using NVIDIA NIM and NVIDIA VIA Microservices	`[Workflow Orchestration]`
2626	Building AI Agents with NVIDIA NIM Microservices and LangChain	`[AI Agent]` `[Workflow Orchestration]`
2633	Building Spatial Intelligence from Real-World 3D Data Using Deep-Learning Framework fVDB	`[Platforms/Tools/Studios]` `[Model and Prompt Chaining]`
2647	Advancing Telepresence and Next-Generation Digital Human Technology with NVIDIA Maxine	`[Platforms/Tools/Studios]`
2689	Scaling intelligent document processing workflows with AWS AI services	`[Workflow Orchestration]`
2734	Unlocking efficiency: Harnessing the power of Selective Execution in Amazon SageMaker Pipelines	`[Workflow Orchestration]`
2785	Streamlining Prior Authorization with Treatline's Generative AI ...	`[Platforms/Tools/Studios]`
2860	Build a Conversational AI app to Interact with AWS using AWS Amplify	`[AI Agent]`
2948	Accenture Extends Generative AI Capabilities to Accelerate ...	`[AI Agent]` `[Workflow Orchestration]`
3100	Amplifying Business Process Automations with UiPath and Amazon SageMaker	`[Model and Prompt Chaining]`
3175	Build a multi-tenant chatbot with RAG using Amazon Bedrock and Amazon EKS	`[Workflow Orchestration]`
3308	Build a generative AI-powered agent assistance application using Amazon Aurora and Amazon SageMaker JumpStart	`[AI Agent]` `[Workflow Orchestration]`
3333	How Reveal's Logikcull used Amazon Comprehend to detect and ...	`[Platforms/Tools/Studios]`
3355	Use generative AI to increase agent productivity through automated call summarization	`[Model and Prompt Chaining]`
3456	Transform enterprise search and knowledge discovery with Glean and Amazon Bedrock	`[Workflow Orchestration]`
3458	Principal Financial Group uses AWS Post Call Analytics solution to extract omnichannel customer insights	`[Workflow Orchestration]`
3567	Build well-architected IDP solutions with a custom lens – Part 1: Operational excellence	`[Workflow Orchestration]`
3568	Drive hyper-personalized customer experiences with Amazon Personalize and generative AI	`[Model and Prompt Chaining]`
3573	Automating product description generation with Amazon Bedrock	`[Platforms/Tools/Studios]`
3599	Building an AI Assistant for Smart Manufacturing with AWS IoT TwinMaker and Amazon Bedrock	`[AI Agent]` `[Workflow Orchestration]`
3614	Nebraska Judicial Branch modernizes its Electronic Exhibits System using AWS	`[AI Agent]`
3870	How Datalex enhances developer experience using Amazon Bedrock	`[Platforms/Tools/Studios]` `[Model and Prompt Chaining]`
3875	How Crayon Uses AWS Language Technologies to Build Intelligent Decision Support Systems	`[Platforms/Tools/Studios]` `[Model and Prompt Chaining]`
3941	Vertex Pharmaceuticals: Accelerating image segmentation for drug discovery imaging using serverless technologies on AWS	`[AI Agent]`
3982	Deploy a Microsoft Teams gateway for Amazon Q Business \| AWS ...	`[AI Agent]` `[Platforms/Tools/Studios]`
4019	How Mendix is transforming customer experiences with generative AI and Amazon Bedrock	`[AI Agent]`
4069	How Shellkode Uses Amazon Bedrock to Convert Natural Language Queries to NoSQL Statements	`[Model and Prompt Chaining]`
4072	Integrate QnABot on AWS with ServiceNow \| AWS Machine ...	`[Model and Prompt Chaining]`
4200	A light in the dark—illuminating dark data with the OSDU Data Platform	`[AI Agent]`
4227	Improving staff productivity at Enel using Amazon Bedrock \| AWS for ...	`[Workflow Orchestration]` `[Platforms/Tools/Studios]`
4246	Leveraging generative AI to simplify the network and service lifecycle management with Amdocs Intelligent OSS on AWS	`[Workflow Orchestration]`
4288	Unlock personalized experiences powered by AI using Amazon Personalize and Amazon OpenSearch Service	`[Model and Prompt Chaining]`
4313	Alida gains deeper understanding of customer feedback with Amazon Bedrock	`[Platforms/Tools/Studios]`
4332	Automate the process to change image backgrounds using Amazon Bedrock and AWS Step Functions	`[Workflow Orchestration]` `[Model and Prompt Chaining]`
4355	How Accenture's CCE Solution Powered by AWS Generative AI ...	`[Model and Prompt Chaining]`
4359	Moderate audio and text chats using AWS AI services and LLMs	`[Workflow Orchestration]`
4384	BMW Group Develops a GenAI Assistant to Accelerate Infrastructure Optimization on AWS	`[AI Agent]` `[Workflow Orchestration]`
4470	AWS for Games debuts Guide to Generative AI for Game Developers, and more at GDC 2024	`[AI Agent]`
4515	Provide live agent assistance for your chatbot users with Amazon Lex and Talkdesk cloud contact center	`[Workflow Orchestration]`
4517	Center for BrainHealth teams up with AWS to grow Charisma program using generative AI and cloud gaming	`[System Architecture]`
4676	Let's Architect! Discovering Generative AI on AWS \| AWS ...	`[AI Agent]`
4677	Guardrails for Amazon Bedrock now available with new safety filters and privacy controls	`[Guardrails]`
4678	Significant new capabilities make it easier to use Amazon Bedrock to build and scale generative AI applications – and achieve impressive results	`[Platforms/Tools/Studios]`
4681	Enhance conversational AI with advanced routing techniques with Amazon Bedrock	`[Workflow Orchestration]`
4683	Relive the Innovation: AWS Next Level Demo Recordings from MWC24	`[AI Agent]`
4700	Building scalable, secure, and reliable RAG applications using Knowledge Bases for Amazon Bedrock	`[Workflow Orchestration]`
4788	Improving inclusion and accessibility through automated document translation with an open source app using Amazon Translate	`[Platforms/Tools/Studios]` `[Workflow Orchestration]`
4881	Building Generative AI prompt chaining workflows with human in the loop	`[Model and Prompt Chaining]`
4936	Executive Conversations: Putting generative AI to work in omnichannel customer service with Prashant Singh, Chief Operating Officer at LeadSquared	`[AI Agent]`
4966	HCL Workload Automation expands AWS integration with AWS Step Functions	`[AI Agent]`
4967	Build a decentralized semantic search engine on heterogeneous data stores using autonomous agents	`[Workflow Orchestration]`
5104	Empowering predictive maintenance with Amazon Bedrock \| AWS ...	`[Model and Prompt Chaining]`
5126	Accelerate deep learning training and simplify orchestration with AWS Trainium and AWS Batch	`[Workflow Orchestration]`
5184	Build an automated insight extraction framework for customer feedback analysis with Amazon Bedrock and Amazon QuickSight	`[Workflow Orchestration]` `[Platforms/Tools/Studios]`
5239	Integrating Amazon Bedrock in your .NET applications \| .NET on ...	`[Platforms/Tools/Studios]`
5240	Build a self-service digital assistant using Amazon Lex and Knowledge Bases for Amazon Bedrock	`[AI Agent]`
5254	Accenture creates a custom memory-persistent conversational user experience using Amazon Q Business	`[Workflow Orchestration]`
5267	Improve your productivity with Amazon Q and Bedrock for SAP use cases	`[Platforms/Tools/Studios]` `[Model and Prompt Chaining]` `[Workflow Orchestration]`
5318	Build enterprise-grade applications with natural language using AWS App Studio (preview)	`[Workflow Orchestration]`
5323	Introducing Amazon Q Developer in SageMaker Studio to streamline ML workflows	`[Platforms/Tools/Studios]`
5375	Video auto-dubbing using Amazon Translate, Amazon Bedrock, and Amazon Polly	`[Workflow Orchestration]`
5378	Workday and AWS Deliver Enhanced AI Capabilities \| AWS Partner ...	`[Platforms/Tools/Studios]`
5415	Secure AccountantAI Chatbot: Lili's journey with Amazon Bedrock ...	`[Guardrails]`
5607	Cepsa Química improves the efficiency and accuracy of product stewardship using Amazon Bedrock	`[Workflow Orchestration]`
5620	Implementing Identity-Aware Sessions with Amazon Q Developer	`[Platforms/Tools/Studios]`
5622	How bpx energy uses Amazon Bedrock to transform oil and gas production insights	`[Model and Prompt Chaining]`
5723	Scaling multimodal understanding to long videos	`[Model and Prompt Chaining]`
5850	AlloyDB and CloudSQL for PostgreSQL on LangChain on Vertex AI ...	`[AI Agent]` `[Workflow Orchestration]` `[Model and Prompt Chaining]`
6073	Driving inclusive AI innovation with Azure AI Studio - Microsoft ...	`[Platforms/Tools/Studios]`
6094	Building AI Agent Applications Series - Understanding AI Agents	`[AI Agent]` `[Workflow Orchestration]`
6149	Creating a RAG Application with Azure Static Web Apps and App ...	`[Platforms/Tools/Studios]` `[Model and Prompt Chaining]` `[Workflow Orchestration]`
6150	Build your own AI Text-to-Image Generator in Visual Studio Code	`[Platforms/Tools/Studios]`
6158	Decoding AI: Part 3, Making data speak human - Azure Government	`[Workflow Orchestration]`
6276	Demystifying Joule - SAP's Generative AI Copilot - SAP Community	`[AI Agent]` `[Platforms/Tools/Studios]`
6305	SACGPT，AI驱动的智能分析应用- SAP Community	`[Model and Prompt Chaining]`
6308	Top 10 takeaways #SAPTeched 2023 - SAP Community	`[AI Agent]` `[Platforms/Tools/Studios]`
6311	SAP TechEd 2023 Through My Lens: SAP BTP reference architectures, use cases, collaboration, and fun-filled evenings with colleagues - SAP ...	`[Model and Prompt Chaining]` `[Workflow Orchestration]`
6355	SAP Hackathon: A Showcase of AI Innovation at The Circle, Zurich	`[AI Agent]`
6441	Revolutionizing Business through the Power of SAP'... - SAP ...	`[Platforms/Tools/Studios]` `[AI Agent]`
6548	Next time "Just Ask": Simplifying Data Exploration - Configuration using a standard ABAP CDS View	`[Platforms/Tools/Studios]`
6570	SAP Build Apps integration with SAP AI Core services: Part 1 - Setup - SAP ...	`[Platforms/Tools/Studios]`
6609	SAP x AI/ML Series: ISLM Embedded Scenario with APL - SAP ...	`[Workflow Orchestration]`
6620	Develop XR With Oracle, Ep. 6: Summarizer + Generator using Database, Vision AI, Cohere, Hugging Face, and Open AI	`[Model and Prompt Chaining]` `[AI Agent]`
6634	Autonomous Database Select AI: Accelerate innovation with enterprise data, OCI Generative AI, and enhanced security	`[Model and Prompt Chaining]`
6646	MyOracle Search powered by Generative AI	`[Workflow Orchestration]`
6648	An AI application that can chat with any service	`[Workflow Orchestration]`
6930	Design Custom Actions for Copilot with These 5 Tips \| Salesforce	`[AI Agent]`
7023	Compute Nest: Enabling Cutting-Edge Generative AI Integration and Knowledge	`[Workflow Orchestration]`
7025	Smart Talk: Empowering Conversations with LLM Langchain AI Chatbots	`[AI Agent]`
7034	Unlock the Power of Generative AI with Alibaba Cloud Model Studio	`[Model and Prompt Chaining]`
7046	Alibaba's Dingtalk Launches AI Agent Marketplace, Upgrades AI ...	`[AI Agent]`
7055	DormChecker: Enhancing Living Conditions in Dormitories with AI Technology	`[Model and Prompt Chaining]`
7090	Compute NestでLLMを使用してPAI-EASとAnalyticDB for PostgreSQLでRAGサービ	`[Workflow Orchestration]` `[Model and Prompt Chaining]`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SE4FM.md

SE4FM.md

Table of Contents

Data Management

Evaluation and Quality Assurance

Model Customization

Model Deployment and Operation

Prompt Construction

System Architecture and Orchestration

Files

SE4FM.md

Latest commit

History

SE4FM.md

File metadata and controls

Table of Contents

Data Management

Evaluation and Quality Assurance

Model Customization

Model Deployment and Operation

Prompt Construction

System Architecture and Orchestration