LLaMa2 GPTQ

Question Answering AI who can provide answers with source documents based on Texonom.

  • Made with Langchain
  • Chat UI support made by Streamlit Web Component
  • Rye based python package management
Bash
$git clone https://github.com/seonglae/llama2gptq$cd llama2gptq$git clone https://huggingface.co/datasets/texonom/llama2gptq db$rye sync$streamlit run chat.py
Click to copy

What's included

Langchain

Langchain made it easy to Prompt engineering of referencing source documents.

GPTQ

CUDA based int4 Model quantization make model available to run in local environment.

ChromaDB

Vector database make LLaMa2 GPTQ provide responses with reference documents

Web UI

Chat UI provided for conversation with private AI without any external API