support for open ai cache #6

bixia · 2024-12-24T06:27:01Z

Improve OpenAI API Response Caching System

Overview

This PR enhances the caching system for OpenAI API calls to improve performance and reduce API costs. The changes provide consistent caching behavior across different API endpoints (OpenAI, Azure, Claude) and request types (chat completions, embeddings).

Key Changes

Cache Implementation

Unified caching approach for all API endpoints (OpenAI, Azure, Claude)
Consistent cache key generation using request data
JSON serialization for embedding responses
Zero-vector fallback for embedding errors

Supported Endpoints

OpenAI chat completions
Azure OpenAI completions
Claude chat completions
OpenAI embeddings
Custom embeddings service

Error Handling

Improved error handling with fallback responses
Zero-vector returns for embedding failures
Cache miss handling with proper error messages

Benefits

Reduced API costs through efficient response caching
Improved performance for repeated queries
Consistent caching behavior across all endpoints
Better error recovery and fallback mechanisms

Testing

The changes have been tested with:

Standard OpenAI endpoints
Azure OpenAI endpoints
Claude API endpoints
Custom embedding services
Error scenarios and fallbacks

Usage Example

# The cache is automatically used for all API calls
response = common_ask(prompt)  # Will use cache if available

# Embeddings are also cached
embedding = common_get_embedding(text)  # Cached with proper JSON serialization

Notes

No database schema changes required
Backwards compatible with existing cache entries
Thread-safe implementation
Proper JSON serialization for embedding vectors

Related Issues

Reduces API costs through caching
Improves response times for repeated queries
Provides consistent behavior across different API endpoints

Future Improvements

Add cache expiration policies
Implement cache size limits
Add cache statistics tracking

okxdex added 2 commits December 24, 2024 14:25

support for open ai cache

a0d92e8

support for open ai cache

0157afb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support for open ai cache #6

support for open ai cache #6

bixia commented Dec 24, 2024

support for open ai cache #6

Are you sure you want to change the base?

support for open ai cache #6

Conversation

bixia commented Dec 24, 2024

Improve OpenAI API Response Caching System

Overview

Key Changes

Cache Implementation

Supported Endpoints

Error Handling

Benefits

Testing

Usage Example

Notes

Related Issues

Future Improvements