ChatGPT-like clone

This is just a small project for lab purposes. NOT for production.

🌈 Table of Contents

Environment
Next plans
Overview
Installation
- Local
- Kubernetes

Environment

Ubuntu 22.04 LTS (Desktop/Cloud Server)
GPU NVIDIA RTX 3070Ti or RTX 4090Ti (tested both)

Your server has been installed nvidia-smi, using sudo ubuntu-drivers autoinstall to install the driver. I am using the latest driver version:

stackops@cuongdm3-voldemort:~$ nvidia-smi --version
NVIDIA-SMI version  : 550.120
NVML version        : 550.120
DRIVER version      : 550.120
CUDA Version        : 12.4

Next plans

Integrated with Triton Inference Server.
Deploy it on Kubernetes.
Streaming messages.
Multi-session support.

Overview

The docker image located at vcr.vngcloud.vn/60108-cuongdm3/chatgpt-like-clone:base.
ChatGPT-like clone is a web-chat application using Gradio and Haystack.
Currently, I am using llama3.1:8b model as the default model.
The model is running in a Ollama Docker container with GPU support.

Installation

Until this step, this application support running on Local and Kubernetes. Depending on your environment, you can choose the suitable way to deploy it.

Local

To run it:

Enable GPU in Docker https://github.com/ollama/ollama/blob/main/docs/docker.md:

curl -fsSL https://nvidia.github.io/libnvidia-container/gpgkey \
    | sudo gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg
curl -s -L https://nvidia.github.io/libnvidia-container/stable/deb/nvidia-container-toolkit.list \
    | sed 's#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://#g' \
    | sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list
sudo apt-get update

sudo apt-get install -y nvidia-container-toolkit

sudo nvidia-ctk runtime configure --runtime=docker

sudo systemctl restart docker

Run Ollama in Docker integrated with GPU:

docker run -d --gpus=all -v ollama:/root/.ollama -p 11434:11434 --name ollama ollama/ollama
docker exec ollama ollama pull llama3.1:8b

Run the Gradio ChatApp:
```
python3 main.py
```
Visit http://localhost:7860 to chat with the model.

Kubernetes

If you tend to deploy this application on Kubernetes, apply the manifest app.yaml, then access the application by the public IP of the worker nodes on port 30007.

# File app.yaml
apiVersion: apps/v1
kind: Deployment
metadata:
  name: chatgpt-clone-deployment
spec:
  replicas: 1
  selector:
    matchLabels:
      app: chatgpt-clone
  template:
    metadata:
      labels:
        app: chatgpt-clone
    spec:
      containers:
      - name: chatgpt-clone
        image: vcr.vngcloud.vn/60108-cuongdm3/chatgpt-like-clone:base
        command: ["python"]
        args: ["main.py", "http://ollama33.ollama"]  # please chaage the ollama server address
        ports:
        - containerPort: 7860
      nodeSelector:
        vks.vngcloud.vn/nodegroup: nodegroup-27051 
---
apiVersion: v1
kind: Service
metadata:
  name: chatgpt-clone-service
spec:
  selector:
    app: chatgpt-clone
  type: NodePort
  ports:
  - protocol: TCP
    port: 80
    targetPort: 7860
    nodePort: 30007 # Specify a fixed NodePort (optional) or let Kubernetes choose

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
assets		assets
.gitignore		.gitignore
Dockerfile		Dockerfile
Dockerfile.base		Dockerfile.base
README.md		README.md
app.yaml		app.yaml
main.py		main.py
old_main.py		old_main.py
requirements.txt		requirements.txt
sample.pdf		sample.pdf
vi-vks.pdf		vi-vks.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChatGPT-like clone

🌈 Table of Contents

Environment

Next plans

Overview

Installation

Local

Kubernetes

About

Releases

Packages

Languages

cuongpiger/chatgpt-like-clone

Folders and files

Latest commit

History

Repository files navigation

ChatGPT-like clone

🌈 Table of Contents

Environment

Next plans

Overview

Installation

Local

Kubernetes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages