Skip to content

Latest commit

 

History

History
1036 lines (1022 loc) · 449 KB

SE4FM.md

File metadata and controls

1036 lines (1022 loc) · 449 KB

Table of Contents

Data Management

id post tags
1 What Is Retrieval-Augmented Generation aka RAG | NVIDIA Blogs [RAG]
35 NVIDIA Releases Open Synthetic Data Generation Pipeline for Training Large Language Models [Dataset Collection]
57 Generating Science: NVIDIA AI Accelerates HPC Research | NVIDIA ... [RAG]
99 New NVIDIA NeMo Retriever Microservices Boost LLM Accuracy and Throughput [RAG]
137 Scaling Enterprise RAG with Accelerated Ethernet Networking and Networked Storage [RAG]
138 Build Enterprise Retrieval-Augmented Generation Apps with NVIDIA Retrieval QA Embedding Model [Specialized Databases] [RAG]
154 Leverage the Latest Open Models for Synthetic Data Generation with NVIDIA Nemotron-4 340B [Dataset Collection] [Dataset Cleaning and Preparation]
169 Simplify Custom Generative AI Development with NVIDIA NeMo Microservices [Dataset Cleaning and Preparation]
188 Curating Non-English Datasets for LLM Training with NVIDIA NeMo Curator [Dataset Cleaning and Preparation] [Dataset Collection]
195 RAG 101: Retrieval-Augmented Generation Questions Answered [RAG]
309 Scenario Diffusion helps Zoox vehicles navigate safety-critical situations [Dataset Collection] [Dataset Cleaning and Preparation]
341 AWS Pi Day 2024: Use your data to power generative AI | AWS ... [RAG] [Dataset Cleaning and Preparation] [Dataset Collection]
344 Generative AI for Semiconductor Design and Verification | AWS for ... [RAG]
348 How Audi improved their chat experience with Generative AI on Amazon SageMaker [RAG]
361 Learn how Amazon Pharmacy created their LLM-based chat-bot using Amazon SageMaker [RAG]
372 The Fanatics generative AI hackathon | AWS for Industries [RAG]
378 Private network for data movement in generative AI | Networking ... [RAG]
386 Harnessing the power of enterprise data with generative AI: Insights from Amazon Kendra, LangChain, and large language models [RAG]
401 Efficient continual pre-training LLMs for financial domains | AWS ... [Dataset Collection] [Dataset Cleaning and Preparation]
496 Advancements in machine learning for machine learning [Dataset Collection]
515 Improving Gboard language models via private federated analytics [Dataset Cleaning and Preparation]
633 New GenAI Databases Retrieval App helps improves LLM answers ... [RAG]
637 Writer.com wins generative AI success with Google Cloud databases [Specialized Databases]
639 How Spanner vector search supports generative AI apps | Google ... [Specialized Databases]
641 Build User Authentication into your GenAI App Accessing Database ... [RAG]
653 genAI and google cloud ML to get actionable insight | Google Cloud ... [Feature Engineering] [Dataset Collection] [Dataset Cleaning and Preparation]
680 Perform product analysis with generative AI and BigQuery | Google ... [Specialized Databases] [Dataset Collection] [Dataset Cleaning and Preparation]
682 Memorystore for Redis vector search and LangChain integrations for gen AI [RAG]
744 IBM watsonx AI and data platform, security solutions and consulting ... [Dataset Cleaning and Preparation]
779 IBM watsonx Assistant transforms content into conversational ... [RAG]
804 The importance of data ingestion and integration for enterprise AI ... [Dataset Cleaning and Preparation] [Dataset Collection] [RAG]
828 The recipe for RAG: How cloud services enable generative AI ... [RAG]
888 Optimize Azure OpenAI Applications with Semantic Caching [Specialized Databases]
1058 GUEST POST - EmbedElite Meets Semantic Kernel: A Game ... [RAG] [Specialized Databases]
1089 Azure Cosmos DB Conf 2024: Accelerating Innovation in AI and Data [RAG]
1090 Decoding AI: Part 7, Retrieval Augmented Generation with GAI [RAG]
1122 Azure OpenAI On Your Data with Semantic Kernel | Semantic Kernel [Dataset Cleaning and Preparation]
1155 GraphRAG: New tool for complex data discovery now on GitHub [RAG]
1156 GraphRAG: Unlocking LLM discovery on narrative private data [RAG]
1161 Unified Database: Laying the foundation for large language model vertical applications [Specialized Databases] [RAG]
1219 Intelligent monitoring: Towards AI-assisted monitoring for cloud services [Feature Engineering]
1245 Lessons learned from building LLM applications - SAP Community [RAG]
1263 RAG with SAP HANA Cloud Vector Engine, GenAI Hub & CAP [RAG] [Specialized Databases]
1269 Harnessing Generative AI Capabilities with SAP HANA Cloud Vector Engine - Part 1 [Architecture] - SAP ... [Specialized Databases]
1272 GenAI Integration with Events-to-Business Framework for Intelligent Action Synthesis - SAP ... [RAG]
1286 Enhance data privacy in SAP CAP based GenAI application with ... [Dataset Cleaning and Preparation]
1294 Share corporate info with an LLM using Embeddings - SAP ... [Specialized Databases]
1297 A Journey into Retrieval-Augmented Generation (RAG) on SAP BTP [RAG]
1298 Early Adopter Care Program SAP HANA Cloud Vector Engine [RAG]
1301 Generative AI: Some thoughts on using Embeddings - SAP Community [Specialized Databases] [Dataset Cleaning and Preparation]
1308 Enhancing S/4HANA with SAP HANA Cloud Vector Store and GenAI [Specialized Databases]
1309 How Data Anonymization in SAP HANA secure Generative AI applications through SAP Datasphere [Dataset Cleaning and Preparation]
1314 Vectorize your Data : SAP HANA Cloud's Vector Engine for Unified Data Excellence - SAP ... [RAG]
1320 SAP Inside Track Bangalore 2024: Generative AI Extravaganza - SAP ... [RAG]
1328 First steps using the Hana Vector Engine with SAP GEN AI [RAG]
1334 Which Embedding Model should I use with my Corporate LLM? [Specialized Databases]
1341 HANA Vector Engine and LangChain - SAP Community [Specialized Databases]
1348 Analysts react to MySQL HeatWave Gen AI and vector store innovations [Specialized Databases] [Dataset Cleaning and Preparation]
1364 From inference to RAG: Choosing CPUs for efficient generative AI application deployments [RAG]
1370 Unlock the Power of Oracle AI: Near Real-Time data to Feed Your RAG [Feature Engineering]
1372 Generative AI in HeatWave: Introduction [Specialized Databases]
1382 Behind the Scenes: Using OCI Generative AI Agents to improve contextual accuracy [RAG]
1384 Generative AI Chatbot using LLaMA-2, Qdrant, RAG, LangChain & Streamlit [RAG] [Specialized Databases]
1395 GenAI RAG Likes Explicit Relationships: Use Graphs! [RAG]
1398 Leading Industry Analysts Comment on the Release of Oracle Database 23ai [Specialized Databases] [RAG]
1399 OCI Search with OpenSearch 2.11 delivers easy access to latest AI innovations [Specialized Databases]
1426 Similarity Search in Oracle Autonomous Database 19c [Specialized Databases]
1436 Embedded intelligence: Storing and retrieving embeddings in a feature store [Specialized Databases]
1445 Securing the LLM Stack - Cisco Blogs [RAG]
1565 What is prompt grounding? — A generative AI tutorial | Salesforce [RAG]
1571 Vector Databases, Built for the AI Era, Make Your AI Better – Here's ... [Specialized Databases]
1759 How to enable RAG (Retrieval Augmented Generation) on an AMD Ryzen™ AI PC or Radeon Graphics Card - AMD ... [RAG]
1797 Industrial Designer Blends Art and OpenUSD to Create 3D Assets for AI Training [Dataset Collection] [Dataset Labeling and Annotation]
1862 Rack 'n' Roll: NVIDIA Grace Hopper Systems Gather at GTC ... [RAG]
1897 Unlocking the Future of Manufacturing With OpenUSD on Siemens Teamcenter X [Dataset Collection]
1982 Pro Tips for Building Multilingual Recommender Systems | NVIDIA ... [Feature Engineering] [Dataset Collection]
2013 Accelerating Vector Search: Using GPU-Powered Indexes with RAPIDS RAFT [Specialized Databases]
2018 Accelerating Vector Search: Fine-Tuning GPU Index Algorithms [Specialized Databases]
2055 Boost Synthetic Data Generation with Low-Code Workflows in NVIDIA Omniverse Replicator 1.10 [Dataset Collection] [Dataset Cleaning and Preparation]
2072 How to Train Autonomous Mobile Robots to Detect Warehouse Pallet Jacks Using Synthetic Data [Dataset Collection] [Dataset Cleaning and Preparation]
2163 RAG 101: Demystifying Retrieval-Augmented Generation Pipelines [RAG]
2165 Teaching AVs the Language of Human Driving Behavior with Trajeglish [Dataset Collection]
2247 Video: Build a RAG-Powered Chatbot in Five Minutes | NVIDIA ... [RAG]
2284 An Easy Introduction to Multimodal Retrieval-Augmented Generation [RAG]
2326 Scale and Curate High-Quality Datasets for LLM Training with NVIDIA NeMo Curator [Dataset Cleaning and Preparation] [Dataset Collection] [Feature Engineering]
2349 Explainer: What Is Retrieval-Augmented Generation? | NVIDIA ... [RAG]
2468 How to Train an Object Detection Model for Visual Inspection with Synthetic Data [Dataset Collection] [Dataset Cleaning and Preparation]
2531 Optimize AI Model Performance and Maintain Data Privacy with Hybrid RAG [RAG]
2577 Transforming Telco Network Operations Centers with NVIDIA NeMo Retriever and NVIDIA NIM [Dataset Collection] [RAG]
2581 Accelerate AI Infrastructure Using an NVIDIA BlueField-3 DPU Integration with DDN Storage [Dataset Collection] [Dataset Cleaning and Preparation]
2587 Creating Synthetic Data Using Llama 3.1 405B | NVIDIA Technical ... [Dataset Collection] [RAG]
2599 Automating Telco Network Design using NVIDIA NIM and NVIDIA NeMo [RAG]
2617 How to Build a Generative AI-Enabled Synthetic Data Pipeline with OpenUSD [Dataset Collection]
2624 Curating Custom Datasets for LLM Parameter-Efficient Fine-Tuning with NVIDIA NeMo Curator [Dataset Cleaning and Preparation] [Dataset Collection]
2634 Deliver Personalized Retail Experiences with an AI-Powered Shopping Advisor [RAG]
2662 Developing Robust Georgian Automatic Speech Recognition with FastConformer Hybrid Transducer CTC BPE [Dataset Cleaning and Preparation] [Dataset Collection]
2664 Spotlight: NVIDIA BlueField DPUs Power the VAST Data Platform for AI Workload Optimization [Data Management]
2693 Build production-ready generative AI applications for enterprise search using Haystack pipelines and Amazon SageMaker JumpStart with LLMs [RAG]
2899 Simplify access to internal information using Retrieval Augmented Generation and LangChain Agents [RAG]
2965 Maximizing the value of OT data with DeepIQ DataStudio (Data + AI) on AWS [Dataset Cleaning and Preparation] [Dataset Collection]
2979 Unlock ML insights using the Amazon SageMaker Feature Store Feature Processor [Dataset Cleaning and Preparation] [Feature Engineering] [Dataset Collection]
3085 Using AWS generative AI to improve defect detection in Manufacturing [Dataset Collection]
3130 Prepare your data for Amazon Personalize with Amazon SageMaker Data Wrangler [Dataset Cleaning and Preparation] [Dataset Collection]
3138 Index your web crawled content using the new Web Crawler for Amazon Kendra [Dataset Collection]
3213 Automatically redact PII for machine learning using Amazon SageMaker Data Wrangler [Dataset Cleaning and Preparation]
3263 Empower your business users to extract insights from company documents using Amazon SageMaker Canvas and Generative AI [RAG]
3398 Using Fleet Training to Improve Level 3 Digital Twin Virtual Sensors with Ansys on AWS [Dataset Collection]
3420 Amazon Bedrock now provides access to Cohere Command Light and Cohere Embed English and multilingual models [RAG]
3423 Amazon Kinesis Data Streams: celebrating a decade of real-time data innovation [Dataset Collection]
3544 Build scalable and serverless RAG workflows with a vector engine for Amazon OpenSearch Serverless and Amazon Bedrock Claude models [RAG]
3606 Analyze large amounts of graph data to get insights and find trends with Amazon Neptune Analytics [Specialized Databases]
3643 AWS Clean Rooms ML helps customers and partners apply ML models without sharing raw data (preview) [RAG]
3649 Akridata accelerates processing of unstructured data with Amazon S3 Express One Zone [Dataset Labeling and Annotation]
3659 lakeFS and Amazon S3 Express One Zone: Highly performant data version control for ML/AI [Dataset Collection] [Dataset Cleaning and Preparation]
3698 Foundational data protection for enterprise LLM acceleration with Protopia AI [RAG]
3751 Amazon Titan Embeddings for enhanced content recommendations to power 1:1 personalization [Specialized Databases]
3767 Enhance price capture in energy and commodity trading with AWS machine learning [Dataset Collection] [Dataset Cleaning and Preparation]
3896 Amazon OpenSearch Service search enhancements: 2023 roundup [Specialized Databases]
3984 Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace [RAG]
4022 Getting started with Amazon Titan Text Embeddings in Amazon Bedrock [RAG]
4053 Build generative AI applications with Amazon Aurora and Amazon Bedrock Knowledge Bases [RAG]
4055 Analyze security findings faster with no-code data preparation using generative AI and Amazon SageMaker Canvas [Dataset Cleaning and Preparation]
4075 How HSR.health is limiting risks of disease spillover from animals to humans using Amazon SageMaker geospatial capabilities [Dataset Collection]
4090 Improve the performance of generative AI workloads on Amazon Aurora with Optimized Reads and pgvector [Specialized Databases]
4190 Amazon SageMaker Feature Store now supports cross-account sharing, discovery, and access [Specialized Databases]
4196 Build a contextual chatbot application using Knowledge Bases for ... [RAG]
4210 How to Use Amazon SageMaker Pipelines MLOps with Gretel Synthetic Data [Dataset Collection] [Dataset Cleaning and Preparation]
4266 Unlock supply chain value with data and AI | Amazon Supply Chain ... [Data Management]
4270 Use RAG for drug discovery with Knowledge Bases for Amazon Bedrock [RAG]
4367 Build a RAG data ingestion pipeline for large-scale ML workloads [RAG] [Dataset Collection] [Dataset Cleaning and Preparation]
4421 Reimagining Vector Databases for the Generative AI Era with Pinecone Serverless on AWS [Specialized Databases]
4439 On-screen computation time using machine learning tools from AWS [Dataset Collection] [Dataset Cleaning and Preparation]
4565 Accelerate Data Modernization and AI with IBM Databases on AWS [Specialized Databases]
4574 Knowledge Bases for Amazon Bedrock now supports metadata ... [RAG]
4595 Cost-effective document classification using the Amazon Titan Multimodal Embeddings Model [Specialized Databases]
4723 Knowledge Bases in Amazon Bedrock now simplifies asking ... [RAG]
4747 Amazon Titan Text Embeddings V2 now available in Amazon Bedrock, optimized for improving RAG [Specialized Databases]
4805 Unleashing the power of generative AI: Verisk's journey to an Instant ... [RAG]
4935 How LeadSquared accelerated chatbot deployments with generative AI using Amazon Bedrock and Amazon Aurora PostgreSQL [RAG]
4968 Vitech uses Amazon Bedrock to revolutionize information access with AI-powered chatbot [RAG]
4984 Enhance image search experiences with Amazon Personalize, Amazon OpenSearch Service, and Amazon Titan Multimodal Embeddings in Amazon Bedrock [Specialized Databases]
5035 Implement serverless semantic search of image and live video with Amazon Titan Multimodal Embeddings [Specialized Databases] [Dataset Collection]
5052 Build RAG applications using Jina Embeddings v2 on Amazon SageMaker JumpStart [RAG]
5055 Siemens builds Datalake2Go on AWS to analyze disparate data globally [Dataset Collection]
5181 How Krikey AI harnessed the power of Amazon SageMaker Ground Truth to accelerate generative AI development [Dataset Labeling and Annotation]
5242 Improve productivity when processing scanned PDFs using Amazon Q Business [Dataset Collection]
5370 Key considerations when choosing a database for your generative AI applications [Specialized Databases] [RAG]
5511 Improve the productivity of your customer support and project management teams using Amazon Q Business and Atlassian Jira [RAG]
5676 How Deltek uses Amazon Bedrock for question and answering on government solicitation documents [RAG]
5725 SOAR: New algorithms for even faster vector search with ScaNN [Specialized Databases]
5784 Using Filestore as an accelerator for AI/ML workloads on GKE ... [RAG]
5815 BigQuery multimodal embeddings and embedding generation ... [Specialized Databases] [Feature Engineering]
5822 Introducing new PyTorch Dataflux Dataset abstraction | Google ... [Dataset Collection] [Dataset Cleaning and Preparation]
5830 VectorStore in the Cloud SQL for PostgreSQL LangChain package ... [Specialized Databases]
5845 Spanner now supports Approximate Nearest Neighbor (ANN ... [Specialized Databases]
5907 IBM watsonx Assistant: Driving generative AI innovation with ... [RAG]
5934 Synthetic data generation: Building trust by ensuring privacy and ... [Dataset Cleaning and Preparation]
6093 On Your Data Generally Available in Azure OpenAI Service [RAG]
6099 AI Chat App Hack: Watch all the streams! - Microsoft Community Hub [RAG]
6159 "Search the web" for up-to-date OpenAI chat responses - Surface ... [RAG]
6190 Setup an AI Vector Database using the PostgreSQL on SAP BTP, Hyperscaler Option - SAP ... [Specialized Databases]
6278 Recap SAP HANA Cloud @ SAP TechEd - SAP Community [Specialized Databases]
6324 Hana Data Lake Files as Object Store in SAP AI Core [Dataset Collection] [Dataset Cleaning and Preparation]
6331 A Guide to Advanced RAG Techniques for Success in Business Landscape [RAG]
6332 Predict, Personalize, Prosper: Crafting Tomorrow's Retail Experience with RAG - Part 1/3 - SAP Community [RAG]
6382 From Developer's Desk: SAP HANA Cloud Vector Engine - SAP ... [Specialized Databases] [Dataset Cleaning and Preparation]
6461 What's New in SAP HANA Cloud – March 2024 [Specialized Databases]
6477 SAP HANA Cloud Vector Engine: Quick FAQ Reference - SAP ... [RAG]
6492 Python RAG sample for beginners using SAP HANA Cloud and SAP AI Core - SAP ... [RAG]
6494 Embedding Business Context with the SAP HANA Cloud, Vector Engine - SAP ... [RAG]
6561 Vector Data Visualization and Comparision between ... - SAP ... [Specialized Databases] [Dataset Cleaning and Preparation]
6631 Oracle CloudWorld 2023:注目発表まとめ [RAG]
6686 Effortlessly Build AI-Powered Q/A Apps Using HeatWave GenAI [RAG]
6688 Revolutionizing Healthcare with AI: Building an Advanced Chatbot Using Mixtral, Oracle 23AI, RAG, LangChain, and Streamlit [RAG]
6696 Implement Semantic Search in Oracle APEX using AI Vector Search of Oracle Database 23ai [Specialized Databases]
7021 Quickly Building a RAG Service on Compute Nest with LLM on PAI-EAS and [RAG]
7030 Exploring DevOps in the Era of AI Foundation Models Part Ⅱ: Data Warehouse [Dataset Collection] [Dataset Cleaning and Preparation]
7044 Building a Retrieval-Augmented Generation (RAG) Service on Compute Nest with [RAG]
7051 Alibaba Cloud Open Sources Toolkits for Video Generation Model Development [Dataset Cleaning and Preparation]
7053 Build RAG Applications with Spring Cloud Alibaba AI - Alibaba ... [RAG]
7068 One-Click Fitting: Online Retrieval of AnalyticDB Vector for Taobao AI Fitting Room [Specialized Databases]
7094 Alibaba Group's Practice of Accelerating Large Model Training Based on Fluid [Specialized Databases] [RAG]

Evaluation and Quality Assurance

id post tags
28 Acing the Test: NVIDIA Turbocharges Generative AI Training in MLPerf Benchmarks [Model Evaluation]
102 Best Practices for Securing LLM-Enabled Applications | NVIDIA ... [Model Safety and Compliance]
152 Measuring Generative AI Model Performance Using NVIDIA GenAI-Perf and an OpenAI-Compatible API [Model Evaluation]
158 Streamline Evaluation of LLMs for Accuracy with NVIDIA NeMo Evaluator [Model Evaluation]
196 Secure LLM Tokenizers to Maintain Application Integrity | NVIDIA ... [Model Safety and Compliance]
210 Using Chakra execution traces for benchmarking and network ... [Model Evaluation]
384 AWS Audit Manager extends generative AI best practices framework to Amazon SageMaker [Model Safety and Compliance]
437 Responsible AI at Google Research: Adversarial testing for generative AI safety [Testing Strategies] [Model Fairness and Bias] [Model Evaluation] [Model Risk and Trust] [Model Safety and Compliance]
482 WeatherBench 2: A benchmark for the next generation of data-driven weather models [Model Evaluation]
499 Adversarial Nibbler Challenge: Continuous open red-teaming with diverse communities [Model Safety and Compliance]
519 Introducing Gemini: Google's most capable AI model yet [Model Evaluation]
522 How Google is expanding its commitment to secure AI [Model Safety and Compliance]
697 Infographic: To securely build AI on Google Cloud, follow these best ... [Model Risk and Trust]
738 Building AI for business: IBM's Granite foundation models [Model Risk and Trust]
807 How to use foundation models and trusted governance to manage ... [Model Risk and Trust]
832 What is red teaming for generative AI? - IBM Research [Model Safety and Compliance]
845 DARPA and IBM are ensuring that anyone can protect their AI systems from hackers [Model Safety and Compliance]
898 SLM and LLM Evaluation on Custom Data using Prompt Flow [Model Evaluation]
921 Evaluating RAG Applications with AzureML Model Evaluation [Model Evaluation]
945 HiddenLayer Model Scanner helps developers assess the security of open models in the model catalog [Model Safety and Compliance]
1064 Beyond the hype: Part 1, How trustworthy AI empowers US Government agencies [Model Risk and Trust]
1070 Unit Testing with Semantic Kernel | Semantic Kernel [Testing Strategies]
1153 Phi-2: The surprising power of small language models - Microsoft ... [Model Evaluation]
1188 Research Focus: Week of June 10, 2024 - Microsoft Research [Model Evaluation]
1215 Microsoft at VL/HCC 2023: Focus on co-audit tools for spreadsheets [Testing Strategies]
1252 Boosting Benchmarking for Reliable Business AI - SAP Community [Model Evaluation] [Testing Strategies]
1295 Unlocking the Potential of Business AI: Engineering Best Practices - SAP Community [Model Fairness and Bias]
1305 Using Ragas with AI core + other metrics to evaluate LLMs [Model Evaluation]
2032 Preventing Health Data Leaks with Federated Learning Using NVIDIA FLARE [Model Safety and Compliance]
2242 Evaluating Retriever for Enterprise-Grade RAG | NVIDIA Technical ... [Model Evaluation]
2533 Addressing Hallucinations in Speech Synthesis LLMs with the NVIDIA NeMo T5-TTS Model [Model Evaluation]
2623 Securing Generative AI Deployments with NVIDIA NIM and NVIDIA NeMo Guardrails [Model Safety and Compliance]
2680 Build and Deploy Secure AI Applications with AIShield and Amazon SageMaker [Model Safety and Compliance]
3212 Securing generative AI: An introduction to the Generative AI Security Scoping Matrix [Model Risk and Trust] [Model Safety and Compliance]
4521 Gradient makes LLM benchmarking cost-effective and effortless with AWS Inferentia [Model Evaluation]
4739 How Arcanum AI Migrated Models from OpenAI to AWS Using Amazon Bedrock and Amazon SageMaker JumpStart [Model Evaluation]
4776 How Patronus AI helps enterprises boost their confidence in generative AI [Model Explainability and Interpretability]
4914 Optimize AI governance with Amazon SageMaker and IBM watsonx.governance [Model Safety and Compliance]
5474 Evaluate conversational AI agents with Amazon Bedrock | AWS ... [Testing Strategies]
6568 Certification for Partner AI Apps on SAP BTP – Ensuring Reliability, Responsibility, and Relevance - SAP Community [Model Safety and Compliance]

Model Customization

id post tags
36 NVIDIA Fast-Tracks Custom Generative AI Model Development for Enterprises [Model Fine-Tuning]
68 NVIDIA and Amdocs Bring Custom Generative AI to Telco Industry ... [General Fine-Tuning]
74 NVIDIA Collaborates With Genentech to Accelerate Drug Discovery Using Generative AI [General Fine-Tuning]
91 NVIDIA NeMo SteerLM Customizes a Model's Responses During ... [General Fine-Tuning]
97 MLPerf Training Results Showcase Unprecedented Performance and Elasticity [LoRA] [General Fine-Tuning]
105 Generative AI and Accelerated Computing for Spear Phishing Detection [General Fine-Tuning]
106 NVIDIA Sets New Generative AI Performance and Scale Records in MLPerf Training v4.0 [General Fine-Tuning]
109 Fine-Tune and Align LLMs Easily with NVIDIA NeMo Customizer [LoRA] [RLHF]
117 Amdocs Accelerates Generative AI Performance and Lowers Costs with NVIDIA NIM [LoRA]
124 Deploy GPU-Optimized AI Software with One Click Using Brev.dev and NVIDIA NGC Catalog [General Fine-Tuning]
127 Streamline Generative AI Development with NVIDIA NeMo on GPU-Accelerated Google Cloud [General Fine-Tuning] [RLHF] [LoRA]
128 Customize Generative AI Models for Enterprise Applications with Llama 3.1 [LoRA]
135 Better 3D Meshes, from Reconstruction to Generative AI | NVIDIA ... [General Fine-Tuning]
142 New NVIDIA NeMo Framework Features and NVIDIA H200 Supercharge LLM Training Performance and Versatility [General Fine-Tuning]
159 Build Custom Enterprise-Grade Generative AI with NVIDIA AI Foundation Models [General Fine-Tuning]
175 NVIDIA NeMo Accelerates LLM Innovation with Hybrid State Space Model Support [General Fine-Tuning]
192 Power Text-Generation Applications with Mistral NeMo 12B Running on a Single GPU [LoRA]
235 Adobe Partners with NVIDIA to Harness the Power of PDF Intelligence with Next-Gen LLMs [General Fine-Tuning]
353 Best Practices from Quantiphi for Unleashing Generative AI Functionality by Fine-Tuning LLMs [General Fine-Tuning] [LoRA]
427 Cappy: Outperforming and boosting large multi-task language models with a small scorer [General Fine-Tuning]
430 ScreenAI: A visual language model for UI and visually-situated language understanding [General Fine-Tuning]
433 USER-LLM: Efficient LLM contextualization with user embeddings [General Fine-Tuning]
435 Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes [General Fine-Tuning]
443 CodecLM: Aligning language models with tailored synthetic data [General Fine-Tuning]
449 Protecting users with differentially private synthetic training data [LoRA] [General Fine-Tuning]
452 Spoken question answering and speech continuation using a spectrogram-powered LLM [General Fine-Tuning]
456 Language to rewards for robotic skill synthesis [RLHF]
492 Unsupervised speech-to-speech translation from monolingual data [General Fine-Tuning]
504 Grammar checking at Google Search scale [General Fine-Tuning]
506 MediaPipe FaceStylizer: On-device real-time few-shot face stylization [General Fine-Tuning]
606 Tune Gemini Pro in Google AI Studio or with the Gemini API [LoRA]
649 BigQuery can now fine-tune models hosted in Vertex AI | Google ... [LoRA] [General Fine-Tuning]
656 Google is a Leader in the 2024 Gartner® Magic Quadrant™ for Data Science and Machine Learning Platforms [RLHF]
733 Generative AI that's tailored for your business needs with watsonx.ai ... [General Fine-Tuning]
834 A new way to collaboratively customize LLMs - IBM Research [General Fine-Tuning]
1159 Research at Microsoft 2023: A year of groundbreaking AI advances and discoveries [General Fine-Tuning]
1169 GigaPath: Whole-Slide Foundation Model for Digital Pathology [General Fine-Tuning]
1177 LoftQ: Reimagining LLM fine-tuning with smarter initialization [General Fine-Tuning]
1189 Lifelong model editing in large language models: Balancing low-cost targeted edits and catastrophic forgetting [General Fine-Tuning]
1202 Learning from interaction with Microsoft Copilot (web) - Microsoft ... [RLHF]
1266 AI Foundation on SAP BTP: Q1 2024 Release Highlights [General Fine-Tuning]
1351 Maximizing LLM training and inference efficiency using CentML on OCI [General Fine-Tuning]
1352 Powering the AI revolution: Oracle at NVIDIA GTC [General Fine-Tuning]
1366 Using open source LLMs on OCI with dstack [General Fine-Tuning]
1379 First Principles: Exploring the depths of OCI Generative AI Service [General Fine-Tuning]
1425 Valence Labs uses OCI to help build largest GNN in drug discovery [General Fine-Tuning]
1668 BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models [LoRA]
1762 Unleashing Next-Gen AI & HPC Performance with the ... - AMD ... [General Fine-Tuning]
1790 NVIDIA and AMD Deliver Powerful Workstations to Accelerate AI, Rendering and Simulation [General Fine-Tuning]
1844 Speak Like a Native: NVIDIA Parlays Win in Voice Challenge [General Fine-Tuning]
1973 Customizing AI Models: Train Character Detection and Recognition Models with NVIDIA TAO [General Fine-Tuning]
2115 Train Generative AI Models for Drug Discovery with NVIDIA BioNeMo Framework [General Fine-Tuning]
2116 Transforming Industrial Defect Detection with NVIDIA TAO and Vision AI Models [General Fine-Tuning]
2150 Develop and Optimize Vision AI Models for Trillions of Devices with NVIDIA TAO [General Fine-Tuning] [RLHF]
2184 Enhancing Phone Customer Service with ASR Customization [General Fine-Tuning]
2195 Robust Scene Text Detection and Recognition: Implementation [General Fine-Tuning]
2217 Create, Share, and Scale Enterprise AI Workflows with NVIDIA AI Workbench, Now in Beta [LoRA]
2222 Emulating the Attention Mechanism in Transformer Models with a Fully Convolutional Network [General Fine-Tuning]
2240 Scalable Federated Learning with NVIDIA FLARE for Enhanced LLM Performance [General Fine-Tuning]
2255 Optimizing OpenFold Training for Drug Discovery | NVIDIA ... [General Fine-Tuning]
2341 New Standard for Speech Recognition and Translation from the NVIDIA NeMo Canary Model [General Fine-Tuning]
2351 Pushing the Boundaries of Speech Recognition with NVIDIA NeMo Parakeet ASR Models [General Fine-Tuning]
2359 Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks [General Fine-Tuning]
2367 Enhance Text-to-Image Fine-Tuning with DRaFT+, Now Part of NVIDIA NeMo [RLHF] [General Fine-Tuning]
2390 Visual Language Models on NVIDIA Hardware with VILA | NVIDIA ... [General Fine-Tuning]
2401 Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 1 [General Fine-Tuning] [LoRA]
2409 Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 2 [General Fine-Tuning]
2435 Training Localized Multilingual LLMs with NVIDIA NeMo, Part 2 [General Fine-Tuning]
2462 Seamlessly Deploying a Swarm of LoRA Adapters with NVIDIA NIM [LoRA]
2532 Customizing NVIDIA NIM for Domain-Specific Needs with NVIDIA NeMo [LoRA]
2540 Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning [LoRA] [General Fine-Tuning]
2544 Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data [General Fine-Tuning]
2567 Spotlight: Siemens Energy Accelerates Power Grid Asset Simulation 10,000x Using NVIDIA Modulus [General Fine-Tuning]
2631 Fast-Track Robot Learning in Simulation Using NVIDIA Isaac Lab [General Fine-Tuning] [RLHF]
2869 Optimize equipment performance with historical data, Ray, and Amazon SageMaker [General Fine-Tuning]
2981 Improving your LLMs with RLHF on Amazon SageMaker | AWS ... [RLHF]
3202 Personalize your search results with Amazon Personalize and Amazon OpenSearch Service integration [General Fine-Tuning]
3507 Fine-tune Whisper models on Amazon SageMaker with LoRA | AWS ... [LoRA]
3540 KT's journey to reduce training time for a vision transformers model ... [General Fine-Tuning]
3541 How Amazon Search M5 saved 30% for LLM training cost by using AWS Trainium [General Fine-Tuning]
3542 BriBooks improves children's creative writing with generative AI ... [General Fine-Tuning]
3710 How Getir reduced model training durations by 90% with Amazon SageMaker and AWS Batch [General Fine-Tuning]
3750 Fine-tune Llama 2 using QLoRA and Deploy it on Amazon SageMaker with AWS Inferentia2 [LoRA]
4023 Train Llama2 with AWS Trainium on Amazon EKS | Containers [General Fine-Tuning]
4272 Accelerating large-scale neural network training on CPUs with ThirdAI and AWS Graviton [General Fine-Tuning]
4522 Unlocking the value of unstructured data: How Coactive built a visual analytics platform on AWS [General Fine-Tuning]
4583 AWS Weekly Roundup: Amazon EC2 G6 instances, Mistral Large on Amazon Bedrock, AWS Deadline Cloud, and more (April 8, 2024) [General Fine-Tuning]
4740 Develop and train large models cost-efficiently with Metaflow and AWS Trainium [General Fine-Tuning]
4777 Fine-tune and deploy language models with Amazon SageMaker Canvas and Amazon Bedrock [General Fine-Tuning]
4802 Boosted.ai's generative AI portfolio manager surfaces near-instant ... [General Fine-Tuning]
4840 Transform customer engagement with no-code LLM fine-tuning using Amazon SageMaker Canvas and SageMaker JumpStart [General Fine-Tuning]
4917 Efficient and cost-effective multi-tenant LoRA serving with Amazon SageMaker [LoRA]
5001 Pre-training genomic language models using AWS HealthOmics and Amazon SageMaker [General Fine-Tuning]
5027 Streamline custom model creation and deployment for Amazon Bedrock with Provisioned Throughput using Terraform [General Fine-Tuning]
5142 Fine-tuning an LLM using QLoRA in AWS GovCloud (US) | AWS ... [LoRA]
5224 The future of productivity agents with NinjaTech AI and AWS Trainium [General Fine-Tuning]
5303 Choice: Keeping pace with emerging models for generative AI in Life Sciences [General Fine-Tuning]
5307 How BRIA AI used distributed training in Amazon SageMaker to train latent diffusion foundation models for commercial use [General Fine-Tuning]
5369 How Mixbook used generative AI to offer personalized photo book experiences [General Fine-Tuning]
5431 How to expansively train Robot Learning by Customers on AWS using functions generated by Large Language Models [RLHF]
5442 Use Llama 3.1 405B for synthetic data generation and distillation to fine-tune smaller models [General Fine-Tuning]
7031 GenAI Model Optimization: Guide to Fine-Tuning and Quantization [General Fine-Tuning]
7033 E2E Development and Usage of LLM Data Processing + Model Training + Model [General Fine-Tuning]
7096 EasyCV | Out-of-the-Box Visual Self-Supervision + Transformer Algorithm Library [General Fine-Tuning]

Model Deployment and Operation

id post tags
3 NVIDIA and Scaleway Speed Development for European Startups and Enterprises [Model Deployment on Cloud]
5 How Amazon and NVIDIA Help Sellers Create Better Product Listings With AI [Model Serving and Scaling] [Model Deployment on Cloud]
9 Ray Shines with NVIDIA AI: Anyscale Collaboration to Help ... [Model Serving and Scaling]
10 At Your Microservice: NVIDIA Smooths Businesses' Journey to ... [Model Deployment on Cloud]
11 LLMs Land on Laptops: NVIDIA, HP CEOs Celebrate AI PCs [Model Deployment on Local]
15 NVIDIA Expands Robotics Platform to Meet the Rise of Generative AI [Model Deployment on Local]
19 Google's Gemma Optimized Across All NVIDIA AI Platforms | NVIDIA ... [Model Deployment on Cloud] [Model Deployment on Local]
22 NVIDIA Grace Hopper Superchip Sweeps MLPerf Inference Benchmarks [Model Serving and Scaling]
29 New Class of Accelerated, Efficient AI Systems Mark the Next Era of Supercomputing [Model Serving and Scaling]
31 NVIDIA BioNeMo Enables Generative AI for Drug Discovery on AWS [Model Deployment on Cloud]
37 NVIDIA Advances Accelerated Computing, Generative AI at AWS re:Invent [Model Serving and Scaling] [Model Deployment on Cloud]
40 KServe Providers Offering NIM Inference in Clouds and Data ... [Model Serving and Scaling] [Model Deployment on Cloud]
41 NVIDIA and Google Cloud Collaborate to Accelerate AI Development [Model Deployment on Cloud]
43 TOPS of the Class: Decoding AI Performance on RTX AI PCs and Workstations [Model Serving and Scaling]
45 Mistral AI and NVIDIA Unveil Mistral NeMo 12B, a Cutting-Edge Enterprise AI Model [Model Serving and Scaling] [Model Deployment on Cloud]
55 Decoding NIM Microservices That Accelerate Generative AI | NVIDIA ... [Model Serving and Scaling] [Model Deployment on Cloud] [Model Deployment on Local]
73 Singtel, NVIDIA to Bring Sovereign AI to Southeast As | NVIDIA Blogs [Model Deployment]
75 How Developers Can Construct the Future of Generative AI at Microsoft Build 2024 [Model Deployment on Cloud]
77 NVIDIA AI Microservices for Drug Discovery, Digital Health Now Integrated With AWS [Model Deployment on Cloud]
78 NVIDIA Collaborates With Microsoft to Help Developers Build ... [Model Deployment on Cloud]
83 NVIDIA Teams With Google DeepMind to Drive LLM Innovation ... [Model Deployment on Cloud]
89 Unlocking AI for Enterprises: Join NVIDIA at Oracle CloudWorld [Model Deployment on Cloud]
92 Large Language Models up to 4x Faster on RTX With TensorRT-LLM for Windows [Model Serving and Scaling]
93 NVIDIA and Alphabet's Intrinsic Put Next-Gen Robotics Within Grasp ... [Model Deployment on Local]
98 Generative AI's Journey to Production Unveiled at Google Cloud ... [Model Deployment on Cloud] [Model Serving and Scaling]
100 NVIDIA TensorRT-LLM Supercharges Large Language Model Inference on NVIDIA H100 GPUs [Model Serving and Scaling] [Model Deployment on Cloud]
107 Deploying Retrieval-Augmented Generation Applications on NVIDIA GH200 Delivers Accelerated Performance [Model Deployment on Cloud]
111 NVIDIA NIM Offers Optimized Inference Microservices for Deploying AI Models at Scale [Model Serving and Scaling] [Model Deployment on Cloud]
113 Supercharging LLM Applications on Windows PCs with NVIDIA RTX Systems [Model Deployment on Local]
114 Personalized Learning with Gipi, NVIDIA TensortRT-LLM, and AI Foundation Models [Model Serving and Scaling]
115 Power Your Business with NVIDIA AI Enterprise 4.0 for Production-Ready Generative AI [Model Serving and Scaling] [Model Deployment on Cloud]
116 Achieving High Mixtral 8x7B Performance with NVIDIA H100 Tensor Core GPUs and NVIDIA TensorRT-LLM [Model Serving and Scaling]
118 Demystifying AI Inference Deployments for Trillion Parameter Large Language Models [Model Serving and Scaling]
119 Achieving Top Inference Performance with the NVIDIA H100 Tensor Core GPU and NVIDIA TensorRT-LLM [Model Serving and Scaling]
120 How to Take a RAG Application from Pilot to Production in Four Steps [Model Deployment on Cloud]
122 Build Enterprise-Grade AI with NVIDIA AI Software | NVIDIA ... [Model Deployment on Cloud] [Model Monitoring]
123 NVIDIA H200 Tensor Core GPUs and NVIDIA TensorRT-LLM Set MLPerf LLM Inference Records [Model Serving and Scaling] [Model Compression] [Model Deployment on Cloud]
125 Get Started with Generative AI Development for Windows PCs with NVIDIA RTX [Model Compression]
129 Leading MLPerf Inference v3.1 Results with NVIDIA GH200 Grace Hopper Superchip Debut [Model Deployment on Cloud]
130 NVIDIA GB200 NVL72 Delivers Trillion-Parameter LLM Training and Real-Time Inference [Model Serving and Scaling] [Model Deployment on Cloud]
131 Production-Ready, Enterprise-Grade Software on NVIDIA IGX Platform, Support for NVIDIA RTX 6000 ADA, and More [Model Deployment on Local] [Model Deployment]
134 Optimizing Inference on Large Language Models with NVIDIA TensorRT-LLM, Now Publicly Available [Model Serving and Scaling] [Model Deployment on Cloud]
139 Deploy Large Language Models at the Edge with NVIDIA IGX Orin Developer Kit [Model Deployment on Local] [Model Compression]
141 Writer Releases Domain-Specific LLMs for Healthcare and Finance [Model Deployment on Cloud]
143 Advancing Security for Large Language Models with NVIDIA GPUs and Edgeless Systems [Model Deployment on Cloud]
144 NVIDIA H100 System for HPC and Generative AI Sets Record for Financial Risk Calculations [Model Serving and Scaling] [Model Deployment on Cloud]
148 NVIDIA TensorRT-LLM Enhancements Deliver Massive Large Language Model Speedups on NVIDIA H200 [Model Serving and Scaling]
149 NVIDIA Collaborates with Hugging Face to Simplify Generative AI Model Deployments [Model Deployment on Cloud] [Model Serving and Scaling]
151 Advancing Production AI with NVIDIA AI Enterprise | NVIDIA ... [Model Monitoring]
153 Accelerate Generative AI Inference Performance with NVIDIA TensorRT Model Optimizer, Now Publicly Available [Model Compression] [Model Serving and Scaling]
160 Turbocharging Meta Llama 3 Performance with NVIDIA TensorRT-LLM and NVIDIA Triton Inference Server [Model Deployment on Cloud]
164 Elevate Enterprise Generative AI App Development with NVIDIA AI on Azure Machine Learning [Model Deployment on Cloud] [Model Serving and Scaling]
165 Join the First NVIDIA LLM Developer Day: Elevate Your App-Building Skills [Model Deployment on Cloud]
168 NVIDIA AI Foundation Models: Build Custom Enterprise Chatbots and Co-Pilots with Production-Ready LLMs [Model Deployment on Cloud]
173 Bringing Generative AI to Life with NVIDIA Jetson | NVIDIA Technical ... [Model Serving and Scaling]
181 NVIDIA TensorRT-LLM Revs Up Inference for Google Gemma [Model Serving and Scaling]
184 A Simple Guide to Deploying Generative AI with NVIDIA NIM [Model Deployment on Cloud]
186 One Giant Superchip for LLMs, Recommenders, and GNNs: Introducing NVIDIA GH200 NVL32 [Model Deployment on Cloud]
187 Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities [Model Serving and Scaling]
194 Bringing Generative AI to the Edge with NVIDIA Metropolis Microservices for Jetson [Model Deployment on Local]
203 Building Meta's GenAI Infrastructure - Engineering at Meta [Model Deployment on Cloud]
204 How Meta trains large language models at scale - Engineering at Meta [Model Deployment on Cloud]
206 How Meta is creating custom silicon for AI - Engineering at Meta [Model Serving and Scaling] [Model Deployment on Cloud]
207 Maintaining large-scale AI capacity at Meta - Engineering at Meta [Model Serving and Scaling] [Model Monitoring]
216 Taming the tail utilization of ads inference at Meta scale ... [Model Deployment on Cloud]
306 More-efficient recovery from failures during large-ML-model training [Model Serving and Scaling]
325 Accelerating the next wave of generative AI startups | AWS Startups ... [Model Deployment on Cloud]
330 Unlocking Innovation: AWS and Anthropic push the boundaries of generative AI together [Model Deployment on Cloud]
334 Why purpose-built artificial intelligence chips may be key to your generative AI strategy [Model Serving and Scaling] [Model Deployment on Cloud]
351 A secure approach to generative AI with AWS | AWS Machine ... [Model Deployment on Cloud]
352 Build an internal SaaS service with cost and usage tracking for foundation models on Amazon Bedrock [Model Serving and Scaling] [Model Monitoring]
357 AWS Healthcare Customers Announce New Generative AI-Powered Solutions at HIMSS 2024 [Model Deployment on Cloud]
362 Designing generative AI workloads for resilience | AWS Machine ... [Model Serving and Scaling]
382 Optimize price-performance of LLM inference on NVIDIA GPUs using the Amazon SageMaker integration with NVIDIA NIM Microservices [Model Serving and Scaling] [Model Deployment on Cloud]
396 eSentire delivers private and secure generative AI interactions to customers with Amazon SageMaker [Model Deployment on Cloud]
406 Improve Amazon Bedrock Observability with Amazon CloudWatch AppSignals [Model Monitoring]
444 Mixed-input matrix multiplication performance optimizations [Model Serving and Scaling]
467 Advances in private training for production on-device language models [Model Deployment on Local]
468 Computer-aided diagnosis for lung cancer screening [Model Deployment on Cloud]
476 MobileDiffusion: Rapid text-to-image generation on-device [Model Deployment on Local]
521 Google Cloud Next 2024: Gemini and generative AI updates [Model Deployment on Cloud]
532 Google: Gemini API, Imagen 2, Duet AI and more updates [Model Deployment on Cloud]
533 Google I/O 2024: Sundar Pichai on Gemini, AI progress and more [Model Deployment on Cloud]
539 5 highlights from Google Cloud Next 2024 [Model Deployment on Cloud]
603 AI Edge Torch Generative API for Custom LLMs on Device - Google ... [Model Deployment on Local]
605 AI Edge Torch: High Performance Inference of PyTorch Models on Mobile Devices [Model Deployment on Local]
607 Model Explorer: Simplifying ML models for Edge devices - Google ... [Model Monitoring]
619 the world's largest distributed LLM training job on TPU v5e | Google ... [Model Serving and Scaling]
622 Accelerating AI Inference with Google Cloud TPUs and GPUs ... [Model Serving and Scaling] [Model Deployment on Cloud]
624 Unlock AI anywhere with Google Distributed Cloud | Google Cloud ... [Model Deployment on Local] [Model Serving and Scaling] [Model Deployment on Cloud]
627 How Cloud TPU v5e accelerates large-scale AI inference | Google ... [Model Serving and Scaling] [Model Deployment on Cloud]
638 What's new with Google Cloud's AI Hypercomputer architecture ... [Model Deployment on Cloud]
650 Performance per dollar of GPUs and TPUs for AI inference | Google ... [Model Serving and Scaling]
655 Introducing Cloud TPU v5p and AI Hypercomputer | Google Cloud ... [Model Deployment on Cloud]
659 Google in The Forrester Wave AI Infrastructure Solutions, Q1 2024 ... [Model Serving and Scaling] [Model Deployment on Cloud]
660 RAG quickstart with Ray, LangChain, and HuggingFace | Google ... [Model Deployment on Cloud]
669 New localllm lets you develop gen AI apps locally, without GPUs ... [Model Deployment on Cloud] [Model Monitoring] [Model Serving and Scaling]
701 Cost-efficient AI inference with Cloud TPU v5e on GKE | Google ... [Model Deployment on Cloud]
704 How Google Cloud is bringing Gemini to organizations everywhere ... [Model Deployment on Cloud]
708 The overwhelmed person's guide to Google Cloud | Google Cloud ... [Model Deployment]
721 IBM Contributions at PyTorch Conference 2023 - IBM Developer [Model Deployment on Cloud]
837 What is AI inferencing? - IBM Research [Model Serving and Scaling] [Model Compression]
841 Why larger LLM context windows are all the rage - IBM Research [Model Deployment on Cloud]
843 The future of AI is open - IBM Research [Model Serving and Scaling]
849 New analog AI chip design uses much less power for AI tasks - IBM ... [Model Compression]
858 Semantic Kernel: Local LLMs Unleashed on Raspberry Pi 5 [Model Deployment on Local]
861 Introducing NVIDIA Nemotron-3 8B LLMs on the Model Catalog [Model Deployment on Cloud]
862 SemanticKernel – Chat Service demo running Llama2 LLM locally in ... [Model Deployment on Local]
863 Fundamental of Deploying Large Language Model Inference [Model Serving and Scaling]
865 Build, benchmark, evaluate and deploy real-time inference endpoint with Prompt Flow [Model Deployment on Cloud]
869 Path to Production Azure OpenAI Instances - Education [Model Monitoring] [Model Serving and Scaling]
879 Welcoming Mistral, Phi, Jais, Code Llama, NVIDIA Nemotron, and more to the Azure AI Model Catalog [Model Deployment on Cloud]
880 Microsoft and Hugging Face deepen generative AI partnership [Model Deployment on Cloud]
882 The LLM Latency Guidebook: Optimizing Response Times for GenAI Applications [Model Serving and Scaling]
899 Enabling satellite operators to offer AI at the edge in space [Model Deployment on Local]
914 Unlocking the power of NPU on Surface: Our “Hello World” journey [Model Deployment on Local] [Model Compression]
915 Learn how to power your AI transformation with the Microsoft Cloud at NVIDIA GTC. [Model Deployment on Cloud] [Model Serving and Scaling]
916 Optimizing Azure OpenAI: A Guide to Limits, Quotas, and Best Practices [Model Serving and Scaling] [Model Monitoring]
919 Microsoft at Supercomputing 2023 [Model Serving and Scaling] [Model Deployment on Cloud]
920 Strategies for Optimizing High-Volume Token Usage with Azure OpenAI [Model Serving and Scaling] [Model Monitoring]
927 Azure OpenAI Service Launches GPT-4 Turbo and GPT-3.5-Turbo-1106 Models [Model Deployment on Cloud]
928 Deploy your Azure Machine Learning prompt flow on virtually any platform [Model Deployment on Cloud]
932 What runs GPT-4o and Microsoft Copilot? | Largest AI supercomputer in the cloud | Mark Russinovich [Model Serving and Scaling] [Model Deployment on Cloud]
952 Microsoft showcases latest AI solutions at NVIDIA GTC [Model Deployment on Cloud]
981 Microsoft and G42 partner to accelerate AI innovation in UAE and beyond [Model Deployment on Cloud]
992 Startups to access high-performance Azure infrastructure, accelerating AI breakthroughs [Model Deployment on Cloud]
1063 Delivering Cutting-Edge AI Solutions to US Government - Azure ... [Model Deployment on Cloud]
1099 Terminal Chat in Windows Terminal Canary - Windows Command ... [Model Deployment on Local]
1106 Image to Text with Semantic Kernel and HuggingFace | Semantic ... [Model Deployment on Cloud] [Model Serving and Scaling]
1154 LLM profiling guides KV cache optimization - Microsoft Research [Model Compression]
1160 Microsoft at ASPLOS 2024: Advancing hardware and software for high-scale, secure, and efficient modern applications [Model Deployment on Cloud]
1170 Splitwise improves GPU usage by splitting LLM inference phases [Model Serving and Scaling] [Model Deployment on Cloud]
1179 Skeleton-of-Thought: Parallel decoding speeds up and improves LLM output [Model Serving and Scaling]
1181 Research Focus: Week of April 15, 2024 - Microsoft Research [Model Serving and Scaling]
1200 Research Focus: Week of September 25, 2023 - Microsoft Research [Model Serving and Scaling]
1224 Efficient and hardware-friendly neural architecture search with SpaceEvo [Model Compression]
1267 Now available: starter kit for genAI on SAP BTP - SAP Community [Model Deployment on Cloud]
1300 Secure your LLM: Consuming SAP Generative AI deployments in a Simple Python App - SAP ... [Model Deployment on Cloud] [Model Serving and Scaling]
1346 Early LLM serving experience and performance results with AMD Instinct MI300X GPUs [Model Deployment on Cloud]
1355 Democratizing Generative AI with CPU-based Inference [Model Compression] [Model Serving and Scaling] [Model Monitoring]
1362 Deploy LangChain applications as OCI model deployments [Model Deployment on Cloud]
1363 Deploy Falcon-7B with NVIDIA TensorRT-LLM on OCI [Model Deployment on Cloud]
1365 Bridging cloud and conversational AI: LangChain and OCI Data Science platform [Model Deployment on Cloud]
1373 Exadata System Software 24ai - Delivers mission critical AI at any scale [Model Serving and Scaling]
1374 Serving LLM using HuggingFace and Kubernetes on OCI - Part II [Model Deployment on Cloud]
1375 Serving LLMs using HuggingFace and Kubernetes on OCI [Model Deployment on Cloud]
1377 The Future of Generative AI: What Enterprises Need to Know [Model Deployment on Cloud]
1380 Bring your own model to OCI Data Science AI Quick Actions [Model Deployment on Cloud]
1391 Deploying ELYZA with vLLM and OCI Data Science [Model Deployment on Cloud] [Model Serving and Scaling]
1394 OCI with NVIDIA A100 Tensor Core GPUs for HPC and AI sets risk calculation records in financial services [Model Serving and Scaling]
1397 Ampere Computing and Wallaroo.AI expand advanced AI options to OCI [Model Deployment on Cloud] [Model Serving and Scaling]
1402 How to Run NVIDIA NeMo on Oracle Cloud Infrastructure [Model Deployment on Cloud]
1403 Practical inferencing of open source models on mainstream GPU-accelerated OCI servers [Model Deployment on Cloud] [Model Compression] [Model Serving and Scaling]
1413 Speeding into the future: How SQream and Oracle catalyze rapid AI innovation [Model Deployment on Cloud]
1414 John Snow Labs chooses OCI to deploy its AI medical chatbot [Model Deployment on Cloud]
1419 AI and the Enterprise: Oracle's New Capabilities for Driving ... [Model Deployment on Cloud]
1420 MLPerf Training Benchmark 4.0 Results on OCI GPU Superclusters [Model Serving and Scaling]
1423 Enhancing OCI Data Science: Unveiling the New Autoscaling Feature for Model Deployment [Model Deployment on Cloud]
1437 Machine learning enhanced real time fraud detection on OCI with NVIDIA Triton Inference Server [Model Deployment on Cloud] [Model Serving and Scaling]
1448 Building Data Center Infrastructure for the AI Revolution - Cisco Blogs [Model Deployment on Cloud]
1471 Operational Innovations for AI and Cloud-Native Workloads from Cisco and Red Hat [Model Serving and Scaling] [Model Deployment on Cloud]
1473 An In-Depth Look at the Cisco CCDE-AI Infrastructure Certification [Model Deployment on Cloud]
1556 Train Your Own LLM or Use an Existing One? | Salesforce [Model Deployment on Cloud]
1730 Power-efficient acceleration for large language models – Qualcomm Cloud AI SDK [Model Deployment on Cloud]
1731 Train anywhere, Infer on Qualcomm Cloud AI 100 [Model Serving and Scaling]
1734 AI workloads with Windows on Snapdragon [Model Deployment on Local]
1735 Bare-metal, Hardware-Accelerated AI for Windows Apps Using ONNX RT [Model Deployment on Cloud]
1736 Give your Hybrid AI the edge with Windows on Snapdragon [Model Deployment on Local] [Model Serving and Scaling]
1737 How to Quadruple LLM Decoding Performance with Speculative Decoding (SpD) and Microscaling (MX) Formats on Qualcomm® Cloud AI 100 [Model Serving and Scaling]
1740 Microsoft Build 2024 – Unleashing the potential of AI with Windows on Snapdragon [Model Serving and Scaling]
1742 How to run a Large Language Model (LLM) on your AMD Ryzen™ AI PC or Radeon Graphics Card - AMD ... [Model Deployment on Local] [Model Serving and Scaling]
1743 Supercharge Your LLMs with AMD Instinct™ MI300X Accelerators and ROCm™ Software - AMD ... [Model Serving and Scaling] [Model Deployment on Cloud]
1744 Reduce Memory Footprint and Improve Performance Running LLMs on AMD Ryzen™ AI and Radeon™ Platforms [Model Compression]
1745 How Infinigence Provides Fast Generative AI Acceleration Solutions on AMD GPUs - AMD ... [Model Compression] [Model Serving and Scaling]
1749 Llama 3.1: Ready to Run on AMD platforms from data center, edge to AI PCs - AMD ... [Model Deployment on Cloud] [Model Serving and Scaling]
1750 Developer Blog: Build a Chatbot with Ryzen™ AI Processors [Model Compression] [Model Deployment on Local]
1754 New AMD ROCm™ 6.1 Software for Radeon™ Release Offers More Choices to AI Developers - AMD ... [Model Deployment]
1756 Enabling AI PCs with Ryzen AI Software - AMD Community [Model Deployment on Local]
1758 Introducing Amuse 2.0 Beta with AMD XDNA™ Super Resolution: a fully local, AI experience - AMD ... [Model Deployment]
1760 Ryzen 7000 Pro with Ryzen AI: A Superior Hybrid Solution - AMD ... [Model Deployment on Local]
1764 All New ONNX Model Zoo Powered by TurnkeyML - AMD Community [Model Compression] [Model Deployment on Cloud]
1809 NVIDIA Brings New Production AI Capabilities to Microsoft Azure at Microsoft Ignite [Model Deployment on Cloud]
1824 NVIDIA Triton Accelerates Inference on Oracle Cloud | NVIDIA Blogs [Model Serving and Scaling] [Model Compression] [Model Monitoring]
1841 NVIDIA Eos Revealed: Peek Into Operations of a Top 10 Supercomputer [Model Serving and Scaling]
1859 New NVIDIA Storage Partner Validation Program Streamlines Enterprise AI Deployments [Model Deployment on Cloud]
1916 NVIDIA Research Wins CVPR Autonomous Grand Challenge for End-to-End Driving [Model Deployment on Local]
1918 'Accelerate Everything,' NVIDIA CEO Says Ahead of COMPUTEX ... [Model Serving and Scaling]
1920 New Performance Optimizations Supercharge NVIDIA RTX AI PCs for Gamers, Creators and Developers [Model Serving and Scaling]
1922 NVIDIA Blackwell Platform Pushes the Boundaries of Scientific Computing [Model Serving and Scaling]
1923 Gen AI Healthcare Accelerated: Dozens of Companies Adopt Meta Llama 3 NIM [Model Deployment on Cloud]
1967 Maximizing Deep Learning Performance on NVIDIA Jetson Orin with DLA [Model Deployment on Local]
1972 Customizing AI Models: Deploy a Character Detection and Recognition Model with NVIDIA Triton [Model Deployment on Cloud]
1974 Scalable AI Sensor Streaming with Multi-GPU and Multi-Node Capabilities in NVIDIA Holoscan 0.6 [Model Serving and Scaling] [Model Deployment on Cloud]
1986 How to Build a Distributed Inference Cache with NVIDIA Triton and Redis [Model Serving and Scaling] [Model Deployment on Cloud] [Model Monitoring]
1993 Speeding Up Text-To-Speech Diffusion Models by Distillation [Model Compression]
1995 Deploying YOLOv5 on NVIDIA Jetson Orin with cuDLA: Quantization-Aware Training to Inference [Model Compression]
2047 Unlock Faster Image Generation in Stable Diffusion Web UI with NVIDIA TensorRT [Model Serving and Scaling]
2140 Fast-Track Computer Vision Deployments with NVIDIA DeepStream and Edge Impulse [Model Deployment on Local] [Model Deployment on Cloud]
2143 Available Now: NVIDIA AI Accelerated DGL and PyG Containers for GNNs [Model Serving and Scaling]
2162 Most Popular NVIDIA Technical Blog Posts of 2023: Generative AI, LLMs, Robotics, and Virtual Worlds Breakthroughs [Model Serving and Scaling]
2180 Accelerating Inference on End-to-End Workflows with H2O.ai and NVIDIA [Model Serving and Scaling]
2181 Develop ML and AI with Metaflow and Deploy with NVIDIA Triton Inference Server [Model Serving and Scaling] [Model Deployment on Cloud]
2182 New Stable Diffusion Models Accelerated with NVIDIA TensorRT [Model Deployment on Cloud]
2185 Experience Real-Time Audio and Video Communication with NVIDIA Maxine [Model Deployment on Cloud]
2192 Delivering Efficient, High-Performance AI Clouds with NVIDIA DOCA 2.5 [Model Deployment on Cloud]
2197 Build Vision AI Applications at the Edge with NVIDIA Metropolis Microservices and APIs [Model Deployment on Local] [Model Deployment on Cloud]
2215 Deploy an AI Coding Assistant with NVIDIA TensorRT-LLM and NVIDIA Triton [Model Deployment on Cloud] [Model Serving and Scaling]
2228 Benchmarking NVIDIA Spectrum-X for AI Network Performance, Now Available from Supermicro [Model Monitoring]
2229 Performance-Efficient Mamba-Chat from NVIDIA AI Foundation Models [Model Deployment on Cloud]
2254 NVIDIA TensorRT Accelerates Stable Diffusion Nearly 2x Faster with 8-bit Post-Training Quantization [Model Compression] [Model Deployment on Cloud]
2281 Breaking Barriers in Healthcare with New Models for Generative AI and Cellular Imaging [Model Deployment on Cloud]
2285 Powering Mission-Critical AI at the Edge with NVIDIA AI Enterprise IGX [Model Monitoring]
2289 Speed Up Your AI Development: NVIDIA AI Workbench Goes GA [Model Deployment on Cloud]
2357 Mistral Large and Mixtral 8x22B LLMs Now Powered by NVIDIA NIM and NVIDIA API [Model Serving and Scaling] [Model Deployment on Cloud]
2386 Regional LLMs SEA-LION and SeaLLM Serve Languages and Cultures of Southeast Asia [Model Deployment on Cloud]
2387 NVIDIA TensorRT 10.0 Upgrades Usability, Performance, and AI Model Support [Model Compression] [Model Serving and Scaling] [Model Deployment on Cloud]
2398 NVIDIA DeepStream 7.0 Milestone Release for Next-Gen Vision AI Development [Model Deployment on Cloud]
2439 Supercharge Generative AI Development with Firebase Genkit, Optimized by NVIDIA RTX GPUs [Model Deployment on Local]
2442 Accelerating Transformers with NVIDIA cuDNN 9 | NVIDIA Technical ... [Model Serving and Scaling]
2449 Enhancing the Apparel Shopping Experience with AI, Emoji-Aware OCR, and Snapchat’s Screenshop [Model Serving and Scaling] [Model Compression]
2451 Build Lifelike Digital Human Technology with NVIDIA ACE, Now Generally Available [Model Deployment on Cloud] [Model Deployment on Local]
2452 Maximum Performance and Minimum Footprint for AI Apps with NVIDIA TensorRT Weight-Stripped Engines [Model Compression] [Model Deployment on Cloud] [Model Deployment on Local] [Model Serving and Scaling]
2454 Streamline Development of AI-Powered Apps with NVIDIA RTX AI Toolkit for Windows RTX PCs [Model Deployment on Cloud]
2457 Building RAG Applications with NVIDIA NIM and Haystack on K8s [Model Deployment on Cloud] [Model Monitoring]
2463 Power Cloud-Native Microservices at the Edge with NVIDIA JetPack 6.0, Now GA [Model Deployment on Local]
2473 Introducing Grouped GEMM APIs in cuBLAS and More Performance Updates [Model Serving and Scaling]
2497 MediaTek Integrates NVIDIA TAO ToolKit with NeuroPilot SDK for Accelerated Development of Edge AI Applications in IoT [Model Deployment on Local]
2504 Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim [Model Deployment on Cloud] [Model Serving and Scaling]
2526 Generate Traffic Insights Using YOLOv8 and NVIDIA JetPack 6.0 [Model Deployment on Local]
2579 Power Your AI Projects with New NVIDIA NIMs for Mistral and Mixtral Models [Model Serving and Scaling] [Model Deployment on Cloud]
2591 Spotlight: HP 3D Printing Open Sources AI Surrogates for Additive Manufacturing Using NVIDIA Modulus [Model Deployment on Cloud]
2592 Develop Production-Grade Text Retrieval Pipelines for RAG with NVIDIA NeMo Retriever [Model Serving and Scaling] [Model Deployment on Cloud]
2622 Accelerating Hebrew LLM Performance with NVIDIA TensorRT-LLM [Model Serving and Scaling]
2638 Access to NVIDIA NIM Now Available Free to Developer Program Members [Model Deployment on Cloud]
2646 Optimizing llama.cpp AI Inference with CUDA Graphs | NVIDIA ... [Model Serving and Scaling]
2654 Computed Tomography Organ and Disease Segmentation Using the NVIDIA VISTA-3D NIM Microservice [Model Deployment on Cloud] [Model Serving and Scaling]
2655 A Deep Dive into the Latest AI Models Optimized with NVIDIA NIM [Model Deployment on Cloud]
2660 Empowering Energy Trading with MetDesk and NVIDIA Earth-2 [Model Serving and Scaling]
2707 How Amazon Shopping uses Amazon Rekognition Content Moderation to review harmful images in product reviews [Model Deployment on Cloud]
2821 Elevating the generative AI experience: Introducing streaming support in Amazon SageMaker hosting [Model Deployment on Cloud]
2836 How Amazon's Search M5 team optimizes compute resources and ... [Model Serving and Scaling]
2873 Deploy Generative AI Models on Amazon EKS | Containers [Model Deployment on Cloud]
2874 Maximizing GPU utilization with NVIDIA's Multi-Instance GPU (MIG ... [Model Serving and Scaling] [Model Deployment on Cloud]
2947 Ray Integration for AWS Trainium and AWS Inferentia is Now Available [Model Serving and Scaling]
2951 Future-proof Your AI at the Edge with AWS | AWS for Industries [Model Deployment on Local]
2954 Train and deploy ML models in a multicloud environment using Amazon SageMaker [Model Deployment on Cloud]
2999 Innovation for Inclusion: Hack.The.Bias with Amazon SageMaker [Model Deployment on Cloud]
3011 Philips Prototypes a Large-scale, Near-real-time Inference Platform to Extend Medical Imaging Using AWS [Model Deployment on Cloud]
3033 Create a Generative AI Gateway to allow secure and compliant consumption of foundation models [Model Serving and Scaling] [Model Deployment on Cloud]
3086 Create an HCLS document summarization application with Falcon using Amazon SageMaker JumpStart [Model Deployment on Cloud]
3132 New – No-code generative AI capabilities now available in Amazon SageMaker Canvas [Model Deployment on Cloud]
3133 Improve performance of Falcon models with Amazon SageMaker [Model Serving and Scaling]
3139 Automated Cloud-to-Edge Deployment of Industrial AI Models with Siemens Industrial Edge [Model Deployment on Local]
3207 How Veriff decreased deployment time by 80% using Amazon SageMaker multi-model endpoints [Model Serving and Scaling] [Model Deployment on Cloud]
3268 Intuitivo achieves higher throughput while saving on AI/ML costs using AWS Inferentia and PyTorch [Model Deployment on Cloud]
3298 Deploy and fine-tune foundation models in Amazon SageMaker JumpStart with two lines of code [Model Deployment on Cloud]
3306 Deploying Level 4 Digital Twin Self-Calibrating Virtual Sensors on AWS [Model Deployment on Cloud]
3400 Build a medical imaging AI inference pipeline with MONAI Deploy on AWS [Model Deployment on Cloud] [Model Serving and Scaling]
3419 Amazon Bedrock now provides access to Meta's Llama 2 Chat 13B ... [Model Deployment on Cloud]
3547 How Snorkel AI achieved over 40% cost savings by scaling machine learning workloads using Amazon EKS [Model Serving and Scaling] [Model Deployment on Cloud] [Model Monitoring]
3548 Text embedding and sentence similarity retrieval at scale with Amazon SageMaker JumpStart [Model Deployment on Cloud] [Model Serving and Scaling]
3551 How Amazon Music uses SageMaker with NVIDIA to optimize ML training and inference performance and cost [Model Serving and Scaling] [Model Deployment on Cloud]
3582 Optimizing costs for Amazon SageMaker Canvas with automatic shutdown of idle apps [Model Monitoring]
3598 Boost inference performance for LLMs with new Amazon SageMaker containers [Model Compression]
3624 OEMs accelerate automated feature development with new Amazon EC2 DL2q instances, powered by the Qualcomm Cloud AI 100 [Model Deployment on Cloud]
3669 Introducing Amazon SageMaker HyperPod to train foundation models at scale [Model Serving and Scaling] [Model Monitoring]
3670 Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 2: Interactive User Experiences in SageMaker Studio [Model Deployment on Cloud] [Model Serving and Scaling]
3673 Reduce model deployment costs by 50% on average using the latest features of Amazon SageMaker [Model Serving and Scaling]
3678 Minimize real-time inference latency by using Amazon SageMaker routing strategies [Model Serving and Scaling] [Model Deployment on Cloud] [Model Monitoring]
3702 Enable faster training with Amazon SageMaker data parallel library [Model Serving and Scaling]
3798 Llama Guard is now available in Amazon SageMaker JumpStart [Model Deployment on Cloud]
3824 Mixtral-8x7B is now available in Amazon SageMaker JumpStart [Model Deployment on Cloud] [Model Serving and Scaling]
3825 Amazon SageMaker model parallel library now accelerates PyTorch FSDP workloads by up to 20% [Model Serving and Scaling] [Model Deployment on Cloud]
3836 Automating Quality Machine Inspection Infused with Edge AI and Digital Twins for Device Monitoring [Model Deployment on Local] [Model Serving and Scaling] [Model Monitoring]
3850 How to become a generative AI builder, starting at square one | AWS ... [Model Deployment on Cloud]
3876 Build an Amazon SageMaker Model Registry approval and promotion workflow with human intervention [Model Monitoring]
3878 Deploy a Slack gateway for Amazon Q Business | AWS Machine ... [Model Deployment on Cloud] [Model Serving and Scaling]
3895 AWS AI Backend Developed by Avahi Enables WittGen Biotechnology to Help Fight Cancer [Model Deployment on Cloud]
3928 Host the Whisper Model on Amazon SageMaker: exploring inference options [Model Deployment on Cloud] [Model Serving and Scaling]
3955 How anti-fraud systems use explainable AI to protect the betting and gaming industry [Model Deployment on Cloud]
4207 Streamline diarization using AI as an assistive technology: ZOO Digital’s story [Model Deployment on Cloud] [Model Serving and Scaling]
4217 Run ML inference on unplanned and spiky traffic using Amazon SageMaker multi-model endpoints [Model Serving and Scaling] [Model Deployment on Cloud]
4263 Generative AI-Powered Clinical Intelligence: Safely Driving Better Outcomes [Model Deployment on Cloud]
4356 Getting Started with Generative AI Using Hugging Face Platform on AWS [Model Deployment on Cloud] [Model Serving and Scaling]
4392 Federated learning on AWS using FedML, Amazon EKS, and Amazon SageMaker [Model Deployment on Cloud]
4418 Powering the generative AI era: What you missed at the AWS Public Sector Symposium Brussels [Model Deployment on Cloud]
4518 Scale LLMs with PyTorch 2.0 FSDP on Amazon EKS – Part 2 | AWS ... [Model Serving and Scaling] [Model Deployment on Cloud]
4530 Tackle complex reasoning tasks with Mistral Large, now available on Amazon Bedrock [Model Deployment on Cloud] [Model Serving and Scaling]
4531 Creating a User Activity Dashboard for Amazon CodeWhisperer [Model Monitoring]
4559 Quora achieved 3x lower latency and 25% lower Costs by modernizing model serving with Nvidia Triton on Amazon EKS [Model Serving and Scaling] [Model Compression]
4568 Nielsen Sports sees 75% cost reduction in video analysis with Amazon SageMaker multi-model endpoints [Model Serving and Scaling]
4577 Boost inference performance for Mixtral and Llama 2 models with new Amazon SageMaker containers [Model Deployment on Cloud] [Model Serving and Scaling] [Model Compression]
4622 Distributed training and efficient scaling with the Amazon SageMaker Model Parallel and Data Parallel Libraries [Model Serving and Scaling]
4660 Use Kubernetes Operators for new inference capabilities in Amazon SageMaker that reduce LLM deployment costs by 50% on average [Model Serving and Scaling] [Model Deployment on Cloud]
4667 Scale AI training and inference for drug discovery through Amazon EKS and Karpenter [Model Deployment on Cloud]
4695 Integrate HyperPod clusters with Active Directory for seamless multi-user login [Model Serving and Scaling] [Model Deployment on Cloud]
4721 Databricks DBRX is now available in Amazon SageMaker JumpStart [Model Deployment on Cloud]
4732 Deploy a Hugging Face (PyAnnote) speaker diarization model on Amazon SageMaker as an asynchronous endpoint [Model Deployment on Cloud]
4737 Run scalable, enterprise-grade generative AI workloads with Cohere Command R & R+, now available in Amazon Bedrock [Model Deployment on Cloud]
4751 Cohere Command R and R+ are now available in Amazon SageMaker JumpStart [Model Deployment on Cloud]
4763 Intelligent rig operations classification with HITL on AWS | AWS for ... [Model Deployment on Cloud]
4781 Accelerate drug discovery with NVIDIA BioNeMo Framework on Amazon EKS [Model Deployment on Cloud]
4782 Amazon Personalize launches new recipes supporting larger item catalogs with lower latency [Model Deployment on Cloud]
4783 AWS Inferentia and AWS Trainium deliver lowest cost to deploy Llama 3 models in Amazon SageMaker JumpStart [Model Deployment on Cloud] [Model Serving and Scaling]
4803 Deploy LLMs in AWS GovCloud (US) Regions using Hugging Face Inference Containers [Model Deployment on Cloud] [Model Serving and Scaling]
4867 Accelerate NLP inference with ONNX Runtime on AWS Graviton processors [Model Serving and Scaling]
4937 Optimized for low-latency workloads, Mistral Small now available in Amazon Bedrock [Model Deployment on Cloud]
4942 Accelerate Mixtral 8x7B pre-training with expert parallelism on Amazon SageMaker [Model Serving and Scaling] [Model Deployment on Cloud]
4962 Large scale training with NVIDIA NeMo Megatron on AWS ParallelCluster using P5 instances [Model Deployment on Cloud]
5004 Falcon 2 11B is now available on Amazon SageMaker JumpStart [Model Deployment on Cloud] [Model Serving and Scaling]
5074 Get started quickly with AWS Trainium and AWS Inferentia using AWS Neuron DLAMI and AWS Neuron DLC [Model Deployment on Cloud]
5082 Sprinklr improves performance by 20% and reduces cost by 25% for machine learning inference on AWS Graviton3 [Model Deployment on Cloud] [Model Serving and Scaling] [Model Monitoring]
5170 Maximize your Amazon Translate architecture using strategic caching layers [Model Serving and Scaling]
5172 Manage Amazon SageMaker JumpStart foundation model access with private hubs [Model Deployment on Cloud]
5187 Improve visibility into Amazon Bedrock usage and performance with Amazon CloudWatch [Model Monitoring]
5192 Scale and simplify ML workload monitoring on Amazon EKS with AWS Neuron Monitor container [Model Monitoring] [Model Serving and Scaling]
5219 Build generative AI applications on Amazon Bedrock — the secure, compliant, and responsible foundation [Model Monitoring]
5259 Accelerated PyTorch inference with torch.compile on AWS Graviton processors [Model Deployment on Cloud]
5308 Achieve up to ~2x higher throughput while reducing costs by up to ~50% for generative AI inference on Amazon SageMaker with the new inference optimization toolkit – Part 2 [Model Compression] [Model Serving and Scaling]
5439 Llama 3.1 models are now available in Amazon SageMaker JumpStart [Model Deployment on Cloud]
5461 Deploying generative AI applications with NVIDIA NIMs on Amazon EKS [Model Deployment on Cloud] [Model Serving and Scaling]
5463 Amazon SageMaker inference launches faster auto scaling for generative AI models [Model Serving and Scaling] [Model Monitoring]
5469 Boosting Salesforce Einstein's code generating model performance ... [Model Serving and Scaling] [Model Deployment on Cloud]
5506 Node problem detection and recovery for AWS Neuron nodes within Amazon EKS clusters [Model Monitoring]
5560 Intuit uses Amazon Bedrock and Anthropic's Claude to explain taxes ... [Model Deployment on Cloud] [Model Serving and Scaling]
5613 Faster LLMs with speculative decoding and AWS Inferentia2 | AWS ... [Model Serving and Scaling]
5666 How Cisco accelerated the use of generative AI with Amazon SageMaker Inference [Model Deployment on Cloud] [Model Serving and Scaling]
5674 Cisco achieves 50% latency improvement using Amazon SageMaker Inference faster autoscaling feature [Model Serving and Scaling]
5714 Neural network pruning with combinatorial optimization [Model Compression]
5733 Touch and see Google Cloud infrastructure in the Hardware-verse ... [Model Serving and Scaling] [Model Deployment on Cloud]
5740 Google Distributed Cloud: new AI and data services | Google Cloud ... [Model Deployment on Cloud] [Model Deployment on Local]
5792 Performance deep dive of Gemma on Google Cloud | Google Cloud ... [Model Deployment on Cloud]
5794 Google Cloud's container platform for the next decade of AI | Google ... [Model Deployment on Cloud]
5899 IBM Watson and ESPN use AI to transform fantasy football data [Model Deployment on Cloud]
6028 Speed, scale and trustworthy AI on IBM Z with Machine Learning for ... [Model Serving and Scaling]
6070 Introducing Azure NC H100 v5 VMs for mid-range AI and HPC workloads [Model Deployment on Cloud]
6112 Annual Roundup on AI Infrastructure Breakthroughs for 2023 [Model Deployment on Cloud] [Model Serving and Scaling]
6165 Discover the Power of SAP AI Core: The New Learning Journey Now Available! [Model Deployment on Cloud]
6166 SAP AI Core - Realtime inference with SAP HANA Machine Learning - SAP ... [Model Serving and Scaling]
6175 SAP AI Core - Scheduling SAP HANA Machine Learning - SAP ... [Model Deployment on Cloud]
6222 AI in SAP BTP: Q3 2023 Highlights – SAP AI Business Services, SAP AI Core and SAP AI Launchpad - SAP ... [Model Serving and Scaling]
6339 It's Christmas! Ollama+Phi-2 on SAP AI Core - SAP Community [Model Serving and Scaling]
6519 Deployment of Seamless M4T v2 models on SAP AI Core - SAP ... [Model Deployment on Cloud] [Model Serving and Scaling]
6531 Leveraging SAP AI Core APIs to Build your own AI Powered Apps - SAP ... [Model Deployment on Cloud]
6532 A Comprehensive Overview of Intelligent Scenario Lifecycle Management (ISLM) [Model Serving and Scaling]
6533 Unlock innovation and transformation with expanded SAP BTP and SAP AI services on Microsoft Azure - SAP ... [Model Deployment on Cloud]
6534 SAP AI Core Static Deployment URL - SAP Community [Model Deployment on Cloud] [Model Monitoring] [Model Serving and Scaling]
6546 CI/CD with SAP AI Core - SAP Community [Model Serving and Scaling]
6557 SAP AI Core is All You Need | 7. Deploying Language Models for Text Generation - SAP ... [Model Deployment on Cloud] [Model Serving and Scaling]
6624 Mistral-7B in OCI Data Science: An overview and deployment guide [Model Deployment on Cloud]
6659 Simplify your model monitoring and MLOps with OML Model Monitoring UI [Model Monitoring]
6666 Accelerating telco innovation by leveraging power of GPUs on Oracle Cloud Infrastructure for enhanced customer experiences and operational efficiency [Model Serving and Scaling] [Model Deployment on Cloud]
6671 Driving Government Innovation: Oracle Cloud Infrastructure Supercluster Leverages NVIDIA AI in Oracle US Government Cloud [Model Deployment on Cloud]
6691 Deploy Llama 3.1 405B in OCI Data Science [Model Deployment on Cloud]
6695 New to OCI AI Infrastructure: Midrange Bare Metal Compute with NVIDIA L40S and VMs with NVIDIA H100/A100 [Model Deployment on Cloud]
6804 Hyperforce: The Trust, Innovation, and Customer Success Enabler [Model Deployment on Cloud]
7027 Unleashing Creativity Exploring the Power of Generative AI on Cloud [Model Deployment on Cloud]
7028 Quickly Deploy Open Source LLMs in EAS - Alibaba Cloud Community [Model Deployment on Cloud] [Model Serving and Scaling]
7029 Deploy a RAG-Based LLM Chatbot in EAS - Alibaba Cloud Community [Model Serving and Scaling]
7032 Accelerating Large Language Model Inference: High-performance TensorRT-LLM [Model Compression]
7036 Alibaba Cloud Launches Tongyi Qianwen 2.0 and Industry-specific Models to [Model Deployment]
7038 Alibaba Cloud Unveils Serverless Solution to Harness Gen-AI Capabilities for [Model Deployment on Cloud]
7042 Best Practices for Large Model Inference in ACK: TensorRT-LLM [Model Deployment on Cloud]
7057 Tongyi Bailian - Model Studio with Chinese Version of Alibaba Cloud [Model Deployment on Cloud]
7059 AI Container Image Deployment: Stable Diffusion - Alibaba Cloud ... [Model Deployment on Cloud]
7065 Rapid Deployment of AI Painting with WebUI on PAI-EAS using Alibaba Cloud [Model Deployment on Cloud]
7067 Quick Start the AI Model on the Alibaba Cloud Model Studio [Model Deployment on Cloud]
7078 AI Container Image Deployment: Qwen-Audio-Chat - Alibaba Cloud ... [Model Deployment on Cloud] [Model Serving and Scaling]
7080 AI Container Image Deployment: Qwen-VL-Chat - Alibaba Cloud ... [Model Deployment on Cloud] [Model Serving and Scaling]
7099 TePDist (an HLO-Based Fully Automatic Distributed System) Has Opened Its Source [Model Serving and Scaling] [Model Compression]
7100 Quickly Deploy Stable Diffusion for Text-to-Image Generation in EAS [Model Deployment on Cloud]
7101 Deploying Pre-trained Models on Alibaba Cloud ECS Using Hugging Face [Model Deployment on Cloud]
7108 DeepRec: A Training and Inference Engine for Sparse Models in Large-Scale [Model Serving and Scaling] [Model Compression] [Model Deployment on Cloud]

Prompt Construction

id post tags
374 Build generative AI chatbots using prompt engineering with Amazon Redshift and Amazon Bedrock [Prompt Engineering]
644 Generate synthetic data with BigQuery DataFrames and LLMs ... [Automated Prompt Generation]
1172 SAMMO: A general-purpose framework for prompt optimization [Automated Prompt Generation] [Prompt Engineering]
1173 LLMLingua: Innovating LLM efficiency with prompt compression [Automated Prompt Generation] [Prompt Engineering]
1213 Steering at the Frontier: Extending the Power of Prompting [Prompt Engineering]
1256 Generate Process Models with GenAI - SAP Community [Prompt Engineering]
1418 Extending SaaS by AI/ML - Part 4: Using SaaS data with LangChain Prompt Templates for Few-Shot learning [Prompt Engineering]
2578 Develop Generative AI-Powered Visual AI Agents for the Edge [Prompt Engineering]
3601 Amazon Redshift adds new AI capabilities, including Amazon Q, to boost efficiency and productivity [Prompt Engineering]
3754 Improve your Stable Diffusion prompts with Retrieval Augmented Generation [Prompt Engineering] [Automated Prompt Generation]
4401 Unlock the potential of generative AI in industrial operations | AWS ... [Prompt Engineering]

System Architecture and Orchestration

id post tags
6 'We Created a Processor for the Generative AI Era,' NVIDIA CEO Says [Platforms/Tools/Studios]
7 How NVIDIA AI Foundry Lets Enterprises Forge Custom Generative AI Models [Platforms/Tools/Studios]
18 Software Developers Launch OpenUSD and Generative AI-Powered Product Configurators Built on NVIDIA Omniverse [Model and Prompt Chaining]
20 A Mighty Meeting: Generative AI, Cybersecurity Connect at RSA [Platforms/Tools/Studios] [Guardrails]
23 NVIDIA and Siemens Bring Immersive Visualization and Generative AI to Industrial Design and Manufacturing [Workflow Orchestration]
30 NVIDIA Unveils Reference Architecture for AI Cloud Providers [Platforms/Tools/Studios]
32 NVIDIA to Acquire GPU Orchestration Software Provider Run:ai [Workflow Orchestration]
38 Boom in AI-Enabled Medical Devices Transforms Healthcare [AI Agent]
51 SoftServe and Continental Drive Digitalization With OpenUSD and Generative AI [AI Agent]
56 Democratizing Industrial Digital Twins With Generative AI and ... [Model and Prompt Chaining]
61 AI Decoded at GTC: Developer Tools and Apps Accelerating AI ... [Platforms/Tools/Studios]
63 Generative AI Developers Harness NVIDIA Technologies to Transform In-Vehicle Experiences [Platforms/Tools/Studios] [AI Agent]
65 NVIDIA Supercharges Digital Marketing With Greater Control Over Generative AI [Platforms/Tools/Studios] [Model and Prompt Chaining] [Workflow Orchestration]
66 NVIDIA Isaac Taps Generative AI for Manufacturing and Logistics Applications [Workflow Orchestration]
79 WPP and NVIDIA Omniverse Help The Coca-Cola Company Scale Generative AI Content That Pops With Brand Authenticity [Workflow Orchestration] [Platforms/Tools/Studios]
88 Broadcasting Breakthroughs: NVIDIA Holoscan for Media, Available Now, Transforms Live Media With Easy AI Integration [Platforms/Tools/Studios]
95 Streaming Ahead: Broadcasters Enhance Creative Workflows and Content Production With NVIDIA Technologies [System Architecture]
103 Getting Started with Large Language Models for Enterprise Solutions [Platforms/Tools/Studios]
104 Develop Custom Enterprise Generative AI with NVIDIA NeMo [Platforms/Tools/Studios]
132 Software-Defined Broadcast with NVIDIA Holoscan for Media [Platforms/Tools/Studios]
145 Build an LLM-Powered Data Agent for Data Analysis | NVIDIA ... [Workflow Orchestration]
155 Translate Your Enterprise Data into Actionable Insights with NVIDIA NeMo Retriever [Model and Prompt Chaining]
174 Enabling Greater Patient-Specific Cardiovascular Care with AI Surrogates [Platforms/Tools/Studios]
177 Build an LLM-Powered API Agent for Task Execution | NVIDIA ... [Workflow Orchestration]
190 Applying Mixture of Experts in LLM Architectures | NVIDIA Technical ... [Model and Prompt Chaining]
205 Watch: Meta's engineers on building network infrastructure for AI ... [Workflow Orchestration] [Platforms/Tools/Studios]
208 Arcadia: An end-to-end AI system performance simulator ... [Platforms/Tools/Studios]
211 RoCE networks for distributed AI training at scale - Engineering at ... [Model and Prompt Chaining]
222 Adobe Express at MAX 2023: The Next Step for Our Open Developer Platform [Platforms/Tools/Studios]
310 How Alexa knows “peanut butter” is one shopping-list item, not two [AI Agent]
313 AWS VP of AI and data on computer vision research at Amazon [Platforms/Tools/Studios]
320 Generative AI partner offerings in AWS Marketplace: Core & Infrastructure Software [Platforms/Tools/Studios]
335 Empowering everyone with GenAI to rapidly build, customize, and deploy apps securely: Highlights from the AWS New York Summit [Model and Prompt Chaining]
338 Conceptual design using generative AI and CFD simulations on AWS [Workflow Orchestration] [Platforms/Tools/Studios]
339 Achieve DevOps maturity with BMC AMI zAdviser Enterprise and Amazon Bedrock [Workflow Orchestration]
342 Why AWS Partners Are Excited About the Latest Innovations in Generative AI on AWS [Platforms/Tools/Studios] [AI Agent] [Model and Prompt Chaining]
349 Emerging Architecture Patterns for Integrating IoT and generative AI on AWS [Model and Prompt Chaining] [AI Agent] [Workflow Orchestration]
358 Building an AI simulation assistant with agentic workflows | AWS ... [Model and Prompt Chaining]
364 How 20 Minutes empowers journalists and boosts audience engagement with generative AI on Amazon Bedrock [Model and Prompt Chaining]
368 Automate chatbot for document and data retrieval using Agents and ... [Model and Prompt Chaining]
379 Build generative AI apps using AWS Step Functions and Amazon Bedrock [Model and Prompt Chaining]
381 Germany's International University of Applied Sciences automates ... [Platforms/Tools/Studios]
383 Learn how Amazon Ads created a generative AI-powered image generation capability using Amazon SageMaker [Workflow Orchestration] [Model and Prompt Chaining] [Platforms/Tools/Studios]
390 AWS AppFabric helps application developers personalize their generative AI assistant with context from multiple applications [AI Agent]
391 Transforming Business Experiences: The Impact of Amazon Q and Generative BI for AWS Partners [AI Agent] [Platforms/Tools/Studios] [Workflow Orchestration]
408 Building a generative AI reservoir simulation assistant with Stone Ridge Technology [AI Agent] [Platforms/Tools/Studios]
413 Build generative AI applications with Amazon Titan Text Premier, Amazon Bedrock, and AWS CDK [Model and Prompt Chaining] [Workflow Orchestration]
414 Learn how to build and deploy tool-using LLM agents using AWS SageMaker JumpStart Foundation Models [AI Agent] [Workflow Orchestration]
415 Medical content creation in the age of generative AI | AWS Machine ... [AI Agent]
440 Accelerating code migrations with AI [Platforms/Tools/Studios]
442 Autonomous visual information seeking with large language models [Workflow Orchestration] [Model and Prompt Chaining]
461 Responsible AI at Google Research: User Experience Team [Platforms/Tools/Studios]
475 LANISTR: Multimodal learning from structured and unstructured data [Platforms/Tools/Studios]
548 Google AI on Android: Work smarter wherever you are [AI Agent]
621 GKE and NVIDIA NeMo framework to train generative AI models ... [Platforms/Tools/Studios]
654 Add gen AI to your apps with BigQuery and Document AI integration ... [Platforms/Tools/Studios]
730 How two software companies are using IBM watsonx for their ... [Model and Prompt Chaining]
737 Watsonx: A game changer for embedding generative AI into ... [Platforms/Tools/Studios] [Model and Prompt Chaining] [AI Agent] [Workflow Orchestration]
745 How to accelerate your data monetization strategy with data ... [Platforms/Tools/Studios]
773 How IBM helps Wimbledon use generative AI to drive personalised ... [Platforms/Tools/Studios] [Model and Prompt Chaining]
788 The AI Assistant for everyone: watsonx Orchestrate combines ... [Platforms/Tools/Studios] [AI Agent] [Workflow Orchestration]
839 LLMs revolutionized AI: LLM-based AI agents are what's next - IBM ... [Workflow Orchestration]
859 Bringing Generative AI to Semiconductor and Electronics Design [AI Agent] [Workflow Orchestration]
860 Navigating the Generative AI Landscape with Azure AI Services: Insights from Customer Round Table [Workflow Orchestration] [Model and Prompt Chaining]
864 Develop and deploy generative AI apps responsibly with Azure AI ... [Platforms/Tools/Studios] [Workflow Orchestration]
870 Semantic Kernel-Powered OpenAI Plugin Development Lifecycle [Model and Prompt Chaining]
871 Innovate with Azure AI Studio AMA: Unleashing Generative AI for Enterprise Solutions [Platforms/Tools/Studios]
872 The Future of Agent Frameworks: TaskWeaver and Microsoft ... [AI Agent] [Model and Prompt Chaining]
873 Microsoft Semantic Kernel and AutoGen: Open Source Frameworks for AI Solutions [Workflow Orchestration] [AI Agent] [Model and Prompt Chaining] [Platforms/Tools/Studios]
874 Microsoft Learn AI Skills Challenge [AI Agent]
881 GenAI Mastery: Crafting Robust Enterprise Solutions with PromptFlow and LangChain [Model and Prompt Chaining]
887 AI Apps: Driving innovation from development to production [Platforms/Tools/Studios]
893 Building Intelligent Applications with Local RAG in .NET and Phi-3: A Hands-On Guide [Workflow Orchestration]
895 Deploy Semantic Kernel with Bot Framework [Workflow Orchestration]
900 How to use Semantic Kernel Bot in-a-box to interact with data using natural language & AI [AI Agent] [Model and Prompt Chaining]
904 Ignite 2023: What's new in Azure AI Platforms – Charting the Future ... [Platforms/Tools/Studios] [Model and Prompt Chaining]
905 Year in review: How Microsoft Copilot, Microsoft Teams, and our partners built a stronger ecosystem [Platforms/Tools/Studios]
908 Extending Semantic Kernel using OllamaSharp for Chat and Text Completion [AI Agent]
909 Redesigning a Retail Copilot with Open Source Models - Microsoft ... [AI Agent] [Platforms/Tools/Studios]
918 Extending Microsoft Copilot for Microsoft 365: A guide for enterprises and ISVs [Platforms/Tools/Studios] [Workflow Orchestration]
923 Revolutionizing Businesses with Virtual AI Agents - Microsoft ... [Workflow Orchestration]
948 Azure AI and Microsoft Fabric Integration: Driving AI Innovation ... [Platforms/Tools/Studios] [AI Agent]
967 Microsoft Ignite 2023: AI transformation and the technology driving change [AI Agent]
968 Manufacturing for tomorrow: Microsoft announces new industrial AI innovations from the cloud to the factory floor [AI Agent]
975 Empowering every scientist with AI-augmented scientific discovery [Model and Prompt Chaining]
997 Accelerating telco transformation in the era of AI - The Official ... [AI Agent] [Workflow Orchestration]
1002 New data and AI solutions in Microsoft Cloud for Sustainability help move organizations from pledges to progress [AI Agent]
1046 GUEST POST: Getting Started with Semantic Kernel for LangChain users [Workflow Orchestration]
1047 Anticipating the future of physical systems design - Azure Government [Workflow Orchestration]
1050 Building your own DB Copilot for Azure SQL with Azure OpenAI GPT-4 [AI Agent] [Model and Prompt Chaining]
1051 Step by Step guide to develop AI Multi-Agent system using Microsoft Semantic Kernel and GPT-4o [AI Agent] [Workflow Orchestration]
1054 Azure OpenAI service is now available in Azure Government - Azure ... [Platforms/Tools/Studios]
1056 Building the next era of AI: Teams AI Library and API message extensions | Ignite 2023 [Platforms/Tools/Studios]
1060 Decoding AI: Part 6, Creating boundary conditions in generative AI [Guardrails]
1071 Customer Case Study: preezie's AI Journey with Microsoft Semantic ... [AI Agent] [Model and Prompt Chaining]
1074 Guest Blog: Microsoft MVP Developed Course on Understanding Semantic Kernel [Model and Prompt Chaining] [Workflow Orchestration] [AI Agent]
1075 Maximizing joy and minimizing toil with great developer experiences [AI Agent] [Workflow Orchestration] [Model and Prompt Chaining]
1084 Use Semantic Kernel to create a Restaurant Bookings Sample with Python [Model and Prompt Chaining] [AI Agent]
1085 Build intelligent apps for Microsoft 365 with Teams Toolkit [AI Agent]
1091 Introducing the v1.0.0 Beta1 for the .NET Semantic Kernel SDK [Workflow Orchestration]
1096 Building AI-powered Microsoft Copilot with SignalR and other open-source tools [Model and Prompt Chaining] [Platforms/Tools/Studios] [Workflow Orchestration]
1098 Customer Case Study: Visma Spcs Improves Customer Experience with Semantic Kernel [Model and Prompt Chaining] [Workflow Orchestration]
1100 Speech-to-speech conversing with OpenAI on Android - Surface ... [Model and Prompt Chaining]
1103 Java 1.0 Release Candidate for Semantic Kernel now available [Platforms/Tools/Studios] [Model and Prompt Chaining]
1109 Opportunities for partners in the Microsoft Teams AI ecosystem [AI Agent] [Platforms/Tools/Studios]
1119 Next steps: how to rapidly reach the potential of AUKUS - Azure ... [Platforms/Tools/Studios] [Model and Prompt Chaining]
1120 Java SDK for Semantic Kernel 1.0.0-rc2 Released - Add AI ... [Platforms/Tools/Studios]
1128 Enhanced Automation in Python: Auto Tool Calling for OpenAI Models in the Semantic Kernel SDK [Workflow Orchestration]
1129 Build 2024 Recap: Bridging the chasm between your ML and app devs [Platforms/Tools/Studios] [Model and Prompt Chaining] [Workflow Orchestration]
1130 Comprehensive Document Translation Solution - Azure Government [Platforms/Tools/Studios]
1134 Making Plans with Semantic Kernel: Implementing the Microsoft Graph Plugin [AI Agent] [Platforms/Tools/Studios] [Workflow Orchestration]
1146 AI Controller Interface: Generative AI with a lightweight, LLM-integrated VM [Model and Prompt Chaining] [Platforms/Tools/Studios] [Workflow Orchestration]
1150 Empowering NGOs with generative AI in the fight against human trafficking [Workflow Orchestration]
1158 AutoGen: Enabling next-generation large language model applications [Workflow Orchestration] [Model and Prompt Chaining] [AI Agent]
1166 Introducing AutoGen Studio: A low-code interface for building multi-agent workflows [Workflow Orchestration] [AI Agent]
1167 GENEVA uses large language models for interactive game narrative design [Model and Prompt Chaining]
1194 TaskWeaver: A code-first agent framework for efficient data analytics and domain adaptation [Workflow Orchestration]
1196 Players, creators, and AI collaborate to build and expand rich game narratives [Model and Prompt Chaining]
1208 Using AI for tiered cloud platform operation - Microsoft Research [AI Agent] [Platforms/Tools/Studios]
1221 Tracing the path to self-adapting AI agents - Microsoft Research [Workflow Orchestration] [AI Agent]
1230 SIGMA: An open-source mixed-reality system for research on physical task assistance [Workflow Orchestration]
1244 How SAP's Generative AI Architecture Redefines Business Applications - SAP ... [Workflow Orchestration]
1247 Generative AI Hub - OUT NOW! - SAP Community [Platforms/Tools/Studios] [Model and Prompt Chaining]
1250 Demystifying Joule - SAP´s New Generative AI Assis... - SAP ... [AI Agent]
1251 Augmenting SAP BTP Use Cases with AI Foundation: A Deep Dive into the Generative AI Hub [Platforms/Tools/Studios]
1257 AI Foundation, SAP's all-in-one AI toolkit for dev... - SAP Community [Platforms/Tools/Studios]
1259 GenAI Reference Solution Architecture on SAP Business Technology Platform - SAP ... [Model and Prompt Chaining]
1261 How SAP's Generative AI Hub facilitates embedded, ... - SAP ... [Platforms/Tools/Studios] [Model and Prompt Chaining] [Workflow Orchestration]
1262 GenAI Mail Insights - Leveraging the generative AI hub in SAP AI Core to improve customer support - SAP Community [Model and Prompt Chaining] [Workflow Orchestration]
1268 SAP TechEd 2023 Executive Keynote - Highlights - SAP Community [Platforms/Tools/Studios]
1274 AIGC Innovative Experiment Integration with SAP Analytics Cloud for Intelligent Decision-making - SAP ... [AI Agent]
1277 Improving Time Management in SAP S/4HANA Cloud: A GenAI Solution [AI Agent]
1278 SAP TechEd Shenanigans: Tales of Tech, Learning & Frolics [AI Agent]
1279 CAP LLM Plugin – Empowering Developers for rapid Gen AI-CAP App Development - SAP ... [Platforms/Tools/Studios]
1281 Harness the Power of Generative AI for Enterprise ... - SAP Community [Platforms/Tools/Studios] [Workflow Orchestration]
1282 SAP Partners unleash Business AI potential at global Hack2Build - SAP ... [Platforms/Tools/Studios] [Workflow Orchestration]
1283 Unlocking AI Potential with SAP Business Technology Platform (BTP) - SAP ... [System Architecture]
1284 Learn how to leverage Generative AI and SAP Build Process ... [Workflow Orchestration]
1285 AI Foundation on SAP BTP: Q4 2023 Release Highlights - SAP ... [Platforms/Tools/Studios]
1291 Integrating AI with SAPUI5 Fiori Apps: Part 2 - Building a Text Summarizer - SAP Community [AI Agent]
1292 Infusing GenAI in BTP Cockpit - SAP Community [AI Agent]
1293 Product Reviews Analysis using Generative AI and No Code Tools - SAP ... [Model and Prompt Chaining] [Workflow Orchestration]
1302 Enterprise Automation: Deep Dive on new Generative AI capabilities! [Model and Prompt Chaining]
1303 Top 9 takeaways #SAPSapphire 2024 - SAP Community [AI Agent]
1307 Integrating AI with SAPUI5 Fiori Apps: Part 1 - Concept - SAP Community [System Architecture]
1317 Predict, Personalize, Prosper: BTP AI Capabilities Redefining Retail Intelligence - Part 3/3 - SAP Community [Platforms/Tools/Studios]
1329 The Power Duo: SAP HANA Cloud and SAP Datasphere Enabling IDA & Gen-AI Driven Solutions - SAP ... [Model and Prompt Chaining]
1336 SAP BTP Innobytes – January 2024 - SAP Community [Platforms/Tools/Studios]
1350 Re-imagining edge data analysis with LLMs and open-source technologies [Workflow Orchestration]
1357 Developing AI applications with OCI Generative AI and LangChain [Model and Prompt Chaining]
1361 Creating LLM powered applications using OCI Generative AI [Model and Prompt Chaining]
1369 Leveraging LangChain and LLM for Seamless Oracle Database Queries [Model and Prompt Chaining]
1381 Accelerate innovation with enterprise data, OCI Generative AI, and enhanced security [Model and Prompt Chaining]
1388 Introducing Select AI - Natural Language to SQL Generation on Autonomous Database [Workflow Orchestration]
1466 Top Takeaways from the Cisco Live 2024 DevNet Zone: AI, Programmability, and More [AI Agent]
1472 Using the Power of Artificial Intelligence to Augment Network Automation [Model and Prompt Chaining]
1561 How to Use Generative AI for App Development | Salesforce [Model and Prompt Chaining]
1611 6 Ways To Try Out the Latest AI and Data Innovations From Salesforce [AI Agent] [Model and Prompt Chaining]
1627 Build AI Apps for IT Fast — Here's Your Roadmap | Salesforce [Platforms/Tools/Studios]
1639 A big data solution finally arrives, in the form of AI | Salesforce [AI Agent]
1658 From Copilot to CoOrchestration [Workflow Orchestration] [Model and Prompt Chaining]
1659 BannerGen: A Library for Multi-Modality Banner Generation [Model and Prompt Chaining]
1787 A Mine-Blowing Breakthrough: Open-Ended AI Agent Voyager Autonomously Plays ‘Minecraft’ [AI Agent]
1808 Bringing Personality to Pixels, Inworld Levels Up Game Characters Using Generative AI [Model and Prompt Chaining]
1861 Staying in Sync: NVIDIA Combines Digital Twins With Real-Time AI for Industrial Automation [Model and Prompt Chaining]
1870 Johnson & Johnson MedTech Works With NVIDIA to Broaden AI's ... [Platforms/Tools/Studios]
1913 Taiwan Electronics Giants Drive Industrial Automation With NVIDIA Metropolis and NIM [AI Agent]
1953 New NVIDIA NIM Microservices Bring Generative AI to Digital Environments [Model and Prompt Chaining] [Workflow Orchestration]
1977 Designing Deep Networks to Process Other Deep Networks [Model and Prompt Chaining]
1991 Fast Track Data Center Workloads and AI Applications with NVIDIA DOCA 2.2 [Platforms/Tools/Studios]
2068 Differentiable Slang: A Shading Language for Renderers That Learn [Platforms/Tools/Studios]
2084 Whole Human Brain Neuro-Mapping at Cellular Resolution on NVIDIA DGX [Workflow Orchestration]
2104 Accelerating Neurosymbolic AI with RAPIDS and Prometheux Vadalog Parallel [Platforms/Tools/Studios]
2117 Boost Meeting Productivity with AI-Powered Note-Taking and Summarization [Model and Prompt Chaining]
2118 Accelerate AI Workflows for 3D Medical Imaging with NVIDIA MONAI Cloud APIs [Workflow Orchestration]
2142 Create Lifelike Avatars with AI Animation and Speech Features in NVIDIA ACE [Platforms/Tools/Studios] [Workflow Orchestration]
2176 Spotlight: Convai Reinvents Non-Playable Character Interactions [AI Agent] [Model and Prompt Chaining]
2177 Building Lifelike Digital Avatars with NVIDIA ACE Microservices [Model and Prompt Chaining]
2232 Spotlight: HOMEE AI Delivers AI-Powered Spatial Planning to Your Living Room [Workflow Orchestration]
2280 Generative AI for Digital Human Technologies and New AI-powered NVIDIA RTX Lighting [AI Agent] [Model and Prompt Chaining]
2288 Scale AI-Enabled Robotics Development Workloads with NVIDIA OSMO [Workflow Orchestration]
2358 Spotlight: Continental and SoftServe Deliver Generative AI-Powered Virtual Factory Solutions with OpenUSD [AI Agent] [Workflow Orchestration] [Model and Prompt Chaining]
2374 Democratizing AI Workflows with Union.ai and NVIDIA DGX Cloud [Workflow Orchestration]
2395 Develop Secure, Reliable Medical Apps with RAG and NVIDIA NeMo Guardrails [Guardrails]
2438 Generative AI Agents Developer Contest: Top Tips for Getting Started [Workflow Orchestration]
2455 Create, Design, and Deploy Robotics Applications Using New NVIDIA Isaac Foundation Models and Workflows [Model and Prompt Chaining]
2456 Building Safer LLM Apps with LangChain Templates and NVIDIA NeMo Guardrails [Guardrails]
2489 Pegatron Simulates and Optimizes Factory Operations with AI-Enabled Digital Twins [Platforms/Tools/Studios]
2491 Optimize Processes for Large Spaces with the Multi-Camera Tracking Workflow [AI Agent]
2501 Video: Talk to Your Supply Chain Data Using NVIDIA NIM | NVIDIA ... [Workflow Orchestration]
2575 Building an AI Agent for Supply Chain Optimization with NVIDIA NIM and cuOpt [AI Agent] [Workflow Orchestration]
2580 Developing Product Configurators with OpenUSD | NVIDIA ... [Workflow Orchestration] [Model and Prompt Chaining]
2585 Build an Agentic RAG Pipeline with Llama 3.1 and NVIDIA NeMo Retriever NIMs [Workflow Orchestration]
2620 Integrate Generative AI into OpenUSD Workflows Using New NVIDIA Omniverse Developer Tools [Platforms/Tools/Studios] [Workflow Orchestration]
2625 Build VLM-Powered Visual AI Agents Using NVIDIA NIM and NVIDIA VIA Microservices [Workflow Orchestration]
2626 Building AI Agents with NVIDIA NIM Microservices and LangChain [AI Agent] [Workflow Orchestration]
2633 Building Spatial Intelligence from Real-World 3D Data Using Deep-Learning Framework fVDB [Platforms/Tools/Studios] [Model and Prompt Chaining]
2647 Advancing Telepresence and Next-Generation Digital Human Technology with NVIDIA Maxine [Platforms/Tools/Studios]
2689 Scaling intelligent document processing workflows with AWS AI services [Workflow Orchestration]
2734 Unlocking efficiency: Harnessing the power of Selective Execution in Amazon SageMaker Pipelines [Workflow Orchestration]
2785 Streamlining Prior Authorization with Treatline's Generative AI ... [Platforms/Tools/Studios]
2860 Build a Conversational AI app to Interact with AWS using AWS Amplify [AI Agent]
2948 Accenture Extends Generative AI Capabilities to Accelerate ... [AI Agent] [Workflow Orchestration]
3100 Amplifying Business Process Automations with UiPath and Amazon SageMaker [Model and Prompt Chaining]
3175 Build a multi-tenant chatbot with RAG using Amazon Bedrock and Amazon EKS [Workflow Orchestration]
3308 Build a generative AI-powered agent assistance application using Amazon Aurora and Amazon SageMaker JumpStart [AI Agent] [Workflow Orchestration]
3333 How Reveal's Logikcull used Amazon Comprehend to detect and ... [Platforms/Tools/Studios]
3355 Use generative AI to increase agent productivity through automated call summarization [Model and Prompt Chaining]
3456 Transform enterprise search and knowledge discovery with Glean and Amazon Bedrock [Workflow Orchestration]
3458 Principal Financial Group uses AWS Post Call Analytics solution to extract omnichannel customer insights [Workflow Orchestration]
3567 Build well-architected IDP solutions with a custom lens – Part 1: Operational excellence [Workflow Orchestration]
3568 Drive hyper-personalized customer experiences with Amazon Personalize and generative AI [Model and Prompt Chaining]
3573 Automating product description generation with Amazon Bedrock [Platforms/Tools/Studios]
3599 Building an AI Assistant for Smart Manufacturing with AWS IoT TwinMaker and Amazon Bedrock [AI Agent] [Workflow Orchestration]
3614 Nebraska Judicial Branch modernizes its Electronic Exhibits System using AWS [AI Agent]
3870 How Datalex enhances developer experience using Amazon Bedrock [Platforms/Tools/Studios] [Model and Prompt Chaining]
3875 How Crayon Uses AWS Language Technologies to Build Intelligent Decision Support Systems [Platforms/Tools/Studios] [Model and Prompt Chaining]
3941 Vertex Pharmaceuticals: Accelerating image segmentation for drug discovery imaging using serverless technologies on AWS [AI Agent]
3982 Deploy a Microsoft Teams gateway for Amazon Q Business | AWS ... [AI Agent] [Platforms/Tools/Studios]
4019 How Mendix is transforming customer experiences with generative AI and Amazon Bedrock [AI Agent]
4069 How Shellkode Uses Amazon Bedrock to Convert Natural Language Queries to NoSQL Statements [Model and Prompt Chaining]
4072 Integrate QnABot on AWS with ServiceNow | AWS Machine ... [Model and Prompt Chaining]
4200 A light in the dark—illuminating dark data with the OSDU Data Platform [AI Agent]
4227 Improving staff productivity at Enel using Amazon Bedrock | AWS for ... [Workflow Orchestration] [Platforms/Tools/Studios]
4246 Leveraging generative AI to simplify the network and service lifecycle management with Amdocs Intelligent OSS on AWS [Workflow Orchestration]
4288 Unlock personalized experiences powered by AI using Amazon Personalize and Amazon OpenSearch Service [Model and Prompt Chaining]
4313 Alida gains deeper understanding of customer feedback with Amazon Bedrock [Platforms/Tools/Studios]
4332 Automate the process to change image backgrounds using Amazon Bedrock and AWS Step Functions [Workflow Orchestration] [Model and Prompt Chaining]
4355 How Accenture's CCE Solution Powered by AWS Generative AI ... [Model and Prompt Chaining]
4359 Moderate audio and text chats using AWS AI services and LLMs [Workflow Orchestration]
4384 BMW Group Develops a GenAI Assistant to Accelerate Infrastructure Optimization on AWS [AI Agent] [Workflow Orchestration]
4470 AWS for Games debuts Guide to Generative AI for Game Developers, and more at GDC 2024 [AI Agent]
4515 Provide live agent assistance for your chatbot users with Amazon Lex and Talkdesk cloud contact center [Workflow Orchestration]
4517 Center for BrainHealth teams up with AWS to grow Charisma program using generative AI and cloud gaming [System Architecture]
4676 Let's Architect! Discovering Generative AI on AWS | AWS ... [AI Agent]
4677 Guardrails for Amazon Bedrock now available with new safety filters and privacy controls [Guardrails]
4678 Significant new capabilities make it easier to use Amazon Bedrock to build and scale generative AI applications – and achieve impressive results [Platforms/Tools/Studios]
4681 Enhance conversational AI with advanced routing techniques with Amazon Bedrock [Workflow Orchestration]
4683 Relive the Innovation: AWS Next Level Demo Recordings from MWC24 [AI Agent]
4700 Building scalable, secure, and reliable RAG applications using Knowledge Bases for Amazon Bedrock [Workflow Orchestration]
4788 Improving inclusion and accessibility through automated document translation with an open source app using Amazon Translate [Platforms/Tools/Studios] [Workflow Orchestration]
4881 Building Generative AI prompt chaining workflows with human in the loop [Model and Prompt Chaining]
4936 Executive Conversations: Putting generative AI to work in omnichannel customer service with Prashant Singh, Chief Operating Officer at LeadSquared [AI Agent]
4966 HCL Workload Automation expands AWS integration with AWS Step Functions [AI Agent]
4967 Build a decentralized semantic search engine on heterogeneous data stores using autonomous agents [Workflow Orchestration]
5104 Empowering predictive maintenance with Amazon Bedrock | AWS ... [Model and Prompt Chaining]
5126 Accelerate deep learning training and simplify orchestration with AWS Trainium and AWS Batch [Workflow Orchestration]
5184 Build an automated insight extraction framework for customer feedback analysis with Amazon Bedrock and Amazon QuickSight [Workflow Orchestration] [Platforms/Tools/Studios]
5239 Integrating Amazon Bedrock in your .NET applications | .NET on ... [Platforms/Tools/Studios]
5240 Build a self-service digital assistant using Amazon Lex and Knowledge Bases for Amazon Bedrock [AI Agent]
5254 Accenture creates a custom memory-persistent conversational user experience using Amazon Q Business [Workflow Orchestration]
5267 Improve your productivity with Amazon Q and Bedrock for SAP use cases [Platforms/Tools/Studios] [Model and Prompt Chaining] [Workflow Orchestration]
5318 Build enterprise-grade applications with natural language using AWS App Studio (preview) [Workflow Orchestration]
5323 Introducing Amazon Q Developer in SageMaker Studio to streamline ML workflows [Platforms/Tools/Studios]
5375 Video auto-dubbing using Amazon Translate, Amazon Bedrock, and Amazon Polly [Workflow Orchestration]
5378 Workday and AWS Deliver Enhanced AI Capabilities | AWS Partner ... [Platforms/Tools/Studios]
5415 Secure AccountantAI Chatbot: Lili's journey with Amazon Bedrock ... [Guardrails]
5607 Cepsa Química improves the efficiency and accuracy of product stewardship using Amazon Bedrock [Workflow Orchestration]
5620 Implementing Identity-Aware Sessions with Amazon Q Developer [Platforms/Tools/Studios]
5622 How bpx energy uses Amazon Bedrock to transform oil and gas production insights [Model and Prompt Chaining]
5723 Scaling multimodal understanding to long videos [Model and Prompt Chaining]
5850 AlloyDB and CloudSQL for PostgreSQL on LangChain on Vertex AI ... [AI Agent] [Workflow Orchestration] [Model and Prompt Chaining]
6073 Driving inclusive AI innovation with Azure AI Studio - Microsoft ... [Platforms/Tools/Studios]
6094 Building AI Agent Applications Series - Understanding AI Agents [AI Agent] [Workflow Orchestration]
6149 Creating a RAG Application with Azure Static Web Apps and App ... [Platforms/Tools/Studios] [Model and Prompt Chaining] [Workflow Orchestration]
6150 Build your own AI Text-to-Image Generator in Visual Studio Code [Platforms/Tools/Studios]
6158 Decoding AI: Part 3, Making data speak human - Azure Government [Workflow Orchestration]
6276 Demystifying Joule - SAP's Generative AI Copilot - SAP Community [AI Agent] [Platforms/Tools/Studios]
6305 SACGPT,AI驱动的智能分析应用- SAP Community [Model and Prompt Chaining]
6308 Top 10 takeaways #SAPTeched 2023 - SAP Community [AI Agent] [Platforms/Tools/Studios]
6311 SAP TechEd 2023 Through My Lens: SAP BTP reference architectures, use cases, collaboration, and fun-filled evenings with colleagues - SAP ... [Model and Prompt Chaining] [Workflow Orchestration]
6355 SAP Hackathon: A Showcase of AI Innovation at The Circle, Zurich [AI Agent]
6441 Revolutionizing Business through the Power of SAP'... - SAP ... [Platforms/Tools/Studios] [AI Agent]
6548 Next time "Just Ask": Simplifying Data Exploration - Configuration using a standard ABAP CDS View [Platforms/Tools/Studios]
6570 SAP Build Apps integration with SAP AI Core services: Part 1 - Setup - SAP ... [Platforms/Tools/Studios]
6609 SAP x AI/ML Series: ISLM Embedded Scenario with APL - SAP ... [Workflow Orchestration]
6620 Develop XR With Oracle, Ep. 6: Summarizer + Generator using Database, Vision AI, Cohere, Hugging Face, and Open AI [Model and Prompt Chaining] [AI Agent]
6634 Autonomous Database Select AI: Accelerate innovation with enterprise data, OCI Generative AI, and enhanced security [Model and Prompt Chaining]
6646 MyOracle Search powered by Generative AI [Workflow Orchestration]
6648 An AI application that can chat with any service [Workflow Orchestration]
6930 Design Custom Actions for Copilot with These 5 Tips | Salesforce [AI Agent]
7023 Compute Nest: Enabling Cutting-Edge Generative AI Integration and Knowledge [Workflow Orchestration]
7025 Smart Talk: Empowering Conversations with LLM Langchain AI Chatbots [AI Agent]
7034 Unlock the Power of Generative AI with Alibaba Cloud Model Studio [Model and Prompt Chaining]
7046 Alibaba's Dingtalk Launches AI Agent Marketplace, Upgrades AI ... [AI Agent]
7055 DormChecker: Enhancing Living Conditions in Dormitories with AI Technology [Model and Prompt Chaining]
7090 Compute NestでLLMを使用してPAI-EASとAnalyticDB for PostgreSQLでRAGサービ [Workflow Orchestration] [Model and Prompt Chaining]