1 tahun lalu · e61f820fa4
--- a/README.md
+++ b/README.md
@@ -1,7 +1,13 @@
 
				-# Llama 2 Fine-tuning / Inference Recipes and Examples
			
 
				+# Llama 2 Fine-tuning / Inference Recipes, Examples and Demo Apps
			
 
				+
			
 
				+**[Update Oct. 20, 2023] We have just released a series of Llama 2 demo apps [here](./demo_apps). These apps show how to run Llama 2 locally and in the cloud to chat about data (PDF, DB, or live) and generate video summary.**
			
 
				+
			
 
				 
			
 
				 The 'llama-recipes' repository is a companion to the [Llama 2 model](https://github.com/facebookresearch/llama). The goal of this repository is to provide examples to quickly get started with fine-tuning for domain adaptation and how to run inference for the fine-tuned models. For ease of use, the examples use Hugging Face converted versions of the models. See steps for conversion of the model [here](#model-conversion-to-hugging-face).
			
 
				 
			
 
				+In addition, we also provide a number of demo apps, to showcase the Llama2 usage along with other ecosystem solutions to run Llama2 locally on your mac and on cloud.
			
 
				+
			
 
				+
			
 
				 Llama 2 is a new technology that carries potential risks with use. Testing conducted to date has not — and could not — cover all scenarios. In order to help developers address these risks, we have created the [Responsible Use Guide](https://github.com/facebookresearch/llama/blob/main/Responsible-Use-Guide.pdf). More details can be found in our research paper as well. For downloading the models, follow the instructions on [Llama 2 repo](https://github.com/facebookresearch/llama).
			
 
				 
			
 
				 
			
@@ -13,8 +19,9 @@ Llama 2 is a new technology that carries potential risks with use. Testing condu
 
				     - [Multi GPU One Node](#multiple-gpus-one-node)
			
 
				     - [Multi GPU Multi Node](#multi-gpu-multi-node)
			
 
				 4. [Inference](./docs/inference.md)
			
 
				-5. [Repository Organization](#repository-organization)
			
 
				-6. [License and Acceptable Use Policy](#license)
			
 
				+5. [Demo Apps](#demo-apps)
			
 
				+6. [Repository Organization](#repository-organization)
			
 
				+7. [License and Acceptable Use Policy](#license)
			
 
				 
			
 
				 
			
 
				 
			
@@ -174,6 +181,17 @@ sbatch multi_node.slurm
 
				 ```
			
 
				 You can read more about our fine-tuning strategies [here](./docs/LLM_finetuning.md).
			
 
				 
			
 
				+# Demo Apps
			
 
				+This folder contains a series of Llama2-powered apps:
			
 
				+* Quickstart Llama deployments and basic interactions with Llama
			
 
				+  1. Llama on your Mac and ask Llama general questions
			
 
				+  2. Llama on Google Colab
			
 
				+  3. Llama on Cloud and ask Llama questions about unstructured data in a PDF
			
 
				+
			
 
				+* Specialized Llama use cases:
			
 
				+  1. Ask Llama to summarize a video content
			
 
				+  2. Ask Llama questions about structured data in a DB
			
 
				+  3. Ask Llama questions about live data on the web
			
 
				 
			
 
				 # Repository Organization
			
 
				 This repository is organized in the following way:
			
@@ -184,6 +202,8 @@ This repository is organized in the following way:
 
				 
			
 
				 [datasets](src/llama_recipes/datasets/): Contains individual scripts for each dataset to download and process. Note: Use of any of the datasets should be in compliance with the dataset's underlying licenses (including but not limited to non-commercial uses)
			
 
				 
			
 
				+[demo_apps](./demo_apps) contains a series of Llama2-powered apps, from quickstart deployments to how to ask Llama questions about unstructured data, structured data, live data, and video summary.
			
 
				+
			
 
				 [examples](./examples/): Contains examples script for finetuning and inference of the Llama 2 model as well as how to use them safely.
			
 
				 
			
 
				 [inference](src/llama_recipes/inference/): Includes modules for inference for the fine-tuned models.
			
--- a/demo_apps/HelloLlamaCloud.ipynb
+++ b/demo_apps/HelloLlamaCloud.ipynb
--- a/demo_apps/HelloLlamaLocal.ipynb
+++ b/demo_apps/HelloLlamaLocal.ipynb
--- a/demo_apps/LiveData.ipynb
+++ b/demo_apps/LiveData.ipynb
@@ -0,0 +1,306 @@
 
				+{
			
 
				+ "cells": [
			
 
				+  {
			
 
				+   "cell_type": "markdown",
			
 
				+   "id": "30eb1704-8d76-4bc9-9308-93243aeb69cb",
			
 
				+   "metadata": {},
			
 
				+   "source": [
			
 
				+    "## This demo app shows:\n",
			
 
				+    "* How to use LlamaIndex, an open source library to help you build custom data augmented LLM applications\n",
			
 
				+    "* How to ask Llama questions about recent live data via the You.com live search API and LlamaIndex\n",
			
 
				+    "\n",
			
 
				+    "The LangChain package is used to facilitate the call to Llama2 hosted on Replicate\n",
			
 
				+    "\n",
			
 
				+    "**Note** We will be using Replicate to run the examples here. You will need to first sign in with Replicate with your github account, then create a free API token [here](https://replicate.com/account/api-tokens) that you can use for a while. \n",
			
 
				+    "After the free trial ends, you will need to enter billing info to continue to use Llama2 hosted on Replicate."
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "markdown",
			
 
				+   "id": "68cf076e",
			
 
				+   "metadata": {},
			
 
				+   "source": [
			
 
				+    "We start by installing the necessary packages:\n",
			
 
				+    "- [langchain](https://python.langchain.com/docs/get_started/introduction) which provides RAG capabilities\n",
			
 
				+    "- [llama-index](https://docs.llamaindex.ai/en/stable/) for data augmentation."
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "code",
			
 
				+   "execution_count": null,
			
 
				+   "id": "1d0005d6-e928-4d1a-981b-534a40e19e56",
			
 
				+   "metadata": {},
			
 
				+   "outputs": [],
			
 
				+   "source": [
			
 
				+    "!pip install llama-index langchain"
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "code",
			
 
				+   "execution_count": null,
			
 
				+   "id": "21fe3849",
			
 
				+   "metadata": {},
			
 
				+   "outputs": [],
			
 
				+   "source": [
			
 
				+    "# use ServiceContext to configure the LLM used and the custom embeddings \n",
			
 
				+    "from llama_index import ServiceContext\n",
			
 
				+    "\n",
			
 
				+    "# VectorStoreIndex is used to index custom data \n",
			
 
				+    "from llama_index import VectorStoreIndex\n",
			
 
				+    "\n",
			
 
				+    "from langchain.llms import Replicate"
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "markdown",
			
 
				+   "id": "73e8e661",
			
 
				+   "metadata": {},
			
 
				+   "source": [
			
 
				+    "Next we set up the Replicate token."
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "code",
			
 
				+   "execution_count": null,
			
 
				+   "id": "d9d76e33",
			
 
				+   "metadata": {},
			
 
				+   "outputs": [],
			
 
				+   "source": [
			
 
				+    "from getpass import getpass\n",
			
 
				+    "import os\n",
			
 
				+    "\n",
			
 
				+    "REPLICATE_API_TOKEN = getpass()\n",
			
 
				+    "os.environ[\"REPLICATE_API_TOKEN\"] = REPLICATE_API_TOKEN"
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "markdown",
			
 
				+   "id": "f8ff812b",
			
 
				+   "metadata": {},
			
 
				+   "source": [
			
 
				+    "In this example we will use the [YOU.com](https://you.com/) search engine to augment the LLM's responses.\n",
			
 
				+    "To use the You.com Search API, you can email api@you.com to request an API key. "
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "code",
			
 
				+   "execution_count": null,
			
 
				+   "id": "75275628-5235-4b55-8033-601c76107528",
			
 
				+   "metadata": {},
			
 
				+   "outputs": [],
			
 
				+   "source": [
			
 
				+    "\n",
			
 
				+    "YOUCOM_API_KEY = getpass()\n",
			
 
				+    "os.environ[\"YOUCOM_API_KEY\"] = YOUCOM_API_KEY"
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "markdown",
			
 
				+   "id": "cb210c7c",
			
 
				+   "metadata": {},
			
 
				+   "source": [
			
 
				+    "We then call the Llama 2 model from replicate. \n",
			
 
				+    "\n",
			
 
				+    "We will use the llama 2 13b chat model. You can find more Llama 2 models by searching for them on the [Replicate model explore page](https://replicate.com/explore?query=llama).\n",
			
 
				+    "You can add them here in the format: model_name/version"
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "code",
			
 
				+   "execution_count": null,
			
 
				+   "id": "c12fc2cb",
			
 
				+   "metadata": {},
			
 
				+   "outputs": [],
			
 
				+   "source": [
			
 
				+    "# set llm to be using Llama2 hosted on Replicate\n",
			
 
				+    "llama2_13b_chat = \"meta/llama-2-13b-chat:f4e2de70d66816a838a89eeeb621910adffb0dd0baba3976c96980970978018d\"\n",
			
 
				+    "\n",
			
 
				+    "llm = Replicate(\n",
			
 
				+    "    model=llama2_13b_chat,\n",
			
 
				+    "    model_kwargs={\"temperature\": 0.01, \"top_p\": 1, \"max_new_tokens\":500}\n",
			
 
				+    ")"
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "markdown",
			
 
				+   "id": "476d72da",
			
 
				+   "metadata": {},
			
 
				+   "source": [
			
 
				+    "Using our api key we set up earlier, we make a request from YOU.com for live data on a particular topic."
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "code",
			
 
				+   "execution_count": null,
			
 
				+   "id": "effc9656-b18d-4d24-a80b-6066564a838b",
			
 
				+   "metadata": {},
			
 
				+   "outputs": [],
			
 
				+   "source": [
			
 
				+    "\n",
			
 
				+    "import requests\n",
			
 
				+    "\n",
			
 
				+    "query = \"Meta Connect\" # you can try other live data query about sports score, stock market and weather info \n",
			
 
				+    "headers = {\"X-API-Key\": os.environ[\"YOUCOM_API_KEY\"]}\n",
			
 
				+    "data = requests.get(\n",
			
 
				+    "    f\"https://api.ydc-index.io/search?query={query}\",\n",
			
 
				+    "    headers=headers,\n",
			
 
				+    ").json()"
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "code",
			
 
				+   "execution_count": null,
			
 
				+   "id": "8bed3baf-742e-473c-ada1-4459012a8a2c",
			
 
				+   "metadata": {},
			
 
				+   "outputs": [],
			
 
				+   "source": [
			
 
				+    "# check the query result in JSON\n",
			
 
				+    "import json\n",
			
 
				+    "\n",
			
 
				+    "print(json.dumps(data, indent=2))"
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "markdown",
			
 
				+   "id": "b196e697",
			
 
				+   "metadata": {},
			
 
				+   "source": [
			
 
				+    "We then use the [`JSONLoader`](https://llamahub.ai/l/file-json) to extract the text from the returned data. The `JSONLoader` gives us the ability to load the data into LamaIndex.\n",
			
 
				+    "In the next cell we show how to load the JSON result with key info stored as \"snippets\".\n",
			
 
				+    "\n",
			
 
				+    "However, you can also add the snippets in the query result to documents like below:\n",
			
 
				+    "```python \n",
			
 
				+    "from llama_index import Document\n",
			
 
				+    "snippets = [snippet for hit in data[\"hits\"] for snippet in hit[\"snippets\"]]\n",
			
 
				+    "documents = [Document(text=s) for s in snippets]\n",
			
 
				+    "```\n",
			
 
				+    "This can be handy if you just need to add a list of text strings to doc"
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "code",
			
 
				+   "execution_count": null,
			
 
				+   "id": "7c40e73f-ca13-4f4a-a753-e613df3d389e",
			
 
				+   "metadata": {},
			
 
				+   "outputs": [],
			
 
				+   "source": [
			
 
				+    "# one way to load the JSON result with key info stored as \"snippets\"\n",
			
 
				+    "from llama_index import download_loader\n",
			
 
				+    "\n",
			
 
				+    "JsonDataReader = download_loader(\"JsonDataReader\")\n",
			
 
				+    "loader = JsonDataReader()\n",
			
 
				+    "documents = loader.load_data([hit[\"snippets\"] for hit in data[\"hits\"]])\n"
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "markdown",
			
 
				+   "id": "8e5e3b4e",
			
 
				+   "metadata": {},
			
 
				+   "source": [
			
 
				+    "With the data set up, we create a vector store for the data and a query engine for it.\n",
			
 
				+    "\n",
			
 
				+    "For our embeddings we will use `HuggingFaceEmbeddings` whose default embedding model is sentence-transformers/all-mpnet-base-v2. This model provides a good balance between speed and performance.\n",
			
 
				+    "To change the default model, call `HuggingFaceEmbeddings(model_name=<another_embedding_model>)`. \n",
			
 
				+    "\n",
			
 
				+    "For more info see https://huggingface.co/blog/mteb. "
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "code",
			
 
				+   "execution_count": null,
			
 
				+   "id": "a5de3080-2c4b-479c-baba-793b3bee36ed",
			
 
				+   "metadata": {},
			
 
				+   "outputs": [],
			
 
				+   "source": [
			
 
				+    "# use HuggingFace embeddings \n",
			
 
				+    "from langchain.embeddings.huggingface import HuggingFaceEmbeddings\n",
			
 
				+    "from llama_index import LangchainEmbedding\n",
			
 
				+    "\n",
			
 
				+    "\n",
			
 
				+    "embeddings = LangchainEmbedding(HuggingFaceEmbeddings())\n",
			
 
				+    "print(embeddings)\n",
			
 
				+    "\n",
			
 
				+    "# create a ServiceContext instance to use Llama2 and custom embeddings\n",
			
 
				+    "service_context = ServiceContext.from_defaults(llm=llm, chunk_size=800, chunk_overlap=20, embed_model=embeddings)\n",
			
 
				+    "\n",
			
 
				+    "# create vector store index from the documents created above\n",
			
 
				+    "index = VectorStoreIndex.from_documents(documents, service_context=service_context)\n",
			
 
				+    "\n",
			
 
				+    "# create query engine from the index\n",
			
 
				+    "query_engine = index.as_query_engine(streaming=True)"
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "markdown",
			
 
				+   "id": "2c4ea012",
			
 
				+   "metadata": {},
			
 
				+   "source": [
			
 
				+    "We are now ready to ask Llama 2 a question about the live data using our query engine."
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "code",
			
 
				+   "execution_count": null,
			
 
				+   "id": "de91a191-d0f2-498e-88dc-b2b43423e0e5",
			
 
				+   "metadata": {},
			
 
				+   "outputs": [],
			
 
				+   "source": [
			
 
				+    "# ask Llama2 a summary question about the search result\n",
			
 
				+    "response = query_engine.query(\"give me a summary\")\n",
			
 
				+    "response.print_response_stream()"
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "code",
			
 
				+   "execution_count": null,
			
 
				+   "id": "72814b20-06aa-4da8-b4dd-f0b0d74a2ea0",
			
 
				+   "metadata": {},
			
 
				+   "outputs": [],
			
 
				+   "source": [
			
 
				+    "# more questions\n",
			
 
				+    "query_engine.query(\"what products were announced\").print_response_stream()"
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "code",
			
 
				+   "execution_count": null,
			
 
				+   "id": "a65bc037-a689-476d-b529-0059a27bc949",
			
 
				+   "metadata": {},
			
 
				+   "outputs": [],
			
 
				+   "source": [
			
 
				+    "query_engine.query(\"tell me more about Meta AI assistant\").print_response_stream()"
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "code",
			
 
				+   "execution_count": null,
			
 
				+   "id": "16a56542",
			
 
				+   "metadata": {},
			
 
				+   "outputs": [],
			
 
				+   "source": [
			
 
				+    "query_engine.query(\"what are Generative AI stickers\").print_response_stream()"
			
 
				+   ]
			
 
				+  }
			
 
				+ ],
			
 
				+ "metadata": {
			
 
				+  "kernelspec": {
			
 
				+   "display_name": "Python 3 (ipykernel)",
			
 
				+   "language": "python",
			
 
				+   "name": "python3"
			
 
				+  },
			
 
				+  "language_info": {
			
 
				+   "codemirror_mode": {
			
 
				+    "name": "ipython",
			
 
				+    "version": 3
			
 
				+   },
			
 
				+   "file_extension": ".py",
			
 
				+   "mimetype": "text/x-python",
			
 
				+   "name": "python",
			
 
				+   "nbconvert_exporter": "python",
			
 
				+   "pygments_lexer": "ipython3",
			
 
				+   "version": "3.8.18"
			
 
				+  }
			
 
				+ },
			
 
				+ "nbformat": 4,
			
 
				+ "nbformat_minor": 5
			
 
				+}
			
--- a/demo_apps/Llama2_Gradio.ipynb
+++ b/demo_apps/Llama2_Gradio.ipynb
@@ -0,0 +1,120 @@
 
				+{
			
 
				+ "cells": [
			
 
				+  {
			
 
				+   "cell_type": "markdown",
			
 
				+   "id": "47a9adb3",
			
 
				+   "metadata": {},
			
 
				+   "source": [
			
 
				+    "## This demo app shows how to query Llama 2 using the Gradio UI.\n",
			
 
				+    "\n",
			
 
				+    "Since we are using Replicate in this example, you will need to replace `<your replicate api token>` with your API token.\n",
			
 
				+    "\n",
			
 
				+    "To get the Replicate token: \n",
			
 
				+    "\n",
			
 
				+    "- You will need to first sign in with Replicate with your github account\n",
			
 
				+    "- Then create a free API token [here](https://replicate.com/account/api-tokens) that you can use for a while \n",
			
 
				+    "\n",
			
 
				+    "**Note** After the free trial ends, you will need to enter billing info to continue to use Llama2 hosted on Replicate.\n",
			
 
				+    "\n",
			
 
				+    "To run this example:\n",
			
 
				+    "- Set up your Replicate API token and enter it in place of `<your replicate api token>`\n",
			
 
				+    "- Run the notebook\n",
			
 
				+    "- Enter your question and click Submit\n",
			
 
				+    "\n",
			
 
				+    "In the notebook or a browser with URL http://127.0.0.1:7860 you should see a UI with your answer."
			
 
				+   ]
			
 
				+  },
			
 
				+  {
			
 
				+   "cell_type": "code",
			
 
				+   "execution_count": 1,
			
 
				+   "id": "928041cc",
			
 
				+   "metadata": {},
			
 
				+   "outputs": [
			
 
				+    {
			
 
				+     "name": "stderr",
			
 
				+     "output_type": "stream",
			
 
				+     "text": [
			
 
				+      "Init param `input` is deprecated, please use `model_kwargs` instead.\n"
			
 
				+     ]
			
 
				+    },
			
 
				+    {
			
 
				+     "name": "stdout",
			
 
				+     "output_type": "stream",
			
 
				+     "text": [
			
 
				+      "Running on local URL:  http://127.0.0.1:7860\n",
			
 
				+      "\n",
			
 
				+      "To create a public link, set `share=True` in `launch()`.\n"
			
 
				+     ]
			
 
				+    },
			
 
				+    {
			
 
				+     "data": {
			
 
				+      "text/html": [
			
 
				+       "<div><iframe src=\"http://127.0.0.1:7860/\" width=\"100%\" height=\"500\" allow=\"autoplay; camera; microphone; clipboard-read; clipboard-write;\" frameborder=\"0\" allowfullscreen></iframe></div>"
			
 
				+      ],
			
 
				+      "text/plain": [
			
 
				+       "<IPython.core.display.HTML object>"
			
 
				+      ]
			
 
				+     },
			
 
				+     "metadata": {},
			
 
				+     "output_type": "display_data"
			
 
				+    },
			
 
				+    {
			
 
				+     "data": {
			
 
				+      "text/plain": []
			
 
				+     },
			
 
				+     "execution_count": 1,
			
 
				+     "metadata": {},
			
 
				+     "output_type": "execute_result"
			
 
				+    }
			
 
				+   ],
			
 
				+   "source": [
			
 
				+    "from langchain.schema import AIMessage, HumanMessage\n",
			
 
				+    "import gradio as gr\n",
			
 
				+    "from langchain.llms import Replicate\n",
			
 
				+    "import os\n",
			
 
				+    "\n",
			
 
				+    "os.environ[\"REPLICATE_API_TOKEN\"] = \"<your replicate api token>\"\n",
			
 
				+    "\n",
			
 
				+    "llama2_13b_chat = \"meta/llama-2-13b-chat:f4e2de70d66816a838a89eeeb621910adffb0dd0baba3976c96980970978018d\"\n",
			
 
				+    "\n",
			
 
				+    "llm = Replicate(\n",
			
 
				+    "    model=llama2_13b_chat,\n",
			
 
				+    "    model_kwargs={\"temperature\": 0.01, \"top_p\": 1, \"max_new_tokens\":500}\n",
			
 
				+    ")\n",
			
 
				+    "\n",
			
 
				+    "\n",
			
 
				+    "def predict(message, history):\n",
			
 
				+    "    history_langchain_format = []\n",
			
 
				+    "    for human, ai in history:\n",
			
 
				+    "        history_langchain_format.append(HumanMessage(content=human))\n",
			
 
				+    "        history_langchain_format.append(AIMessage(content=ai))\n",
			
 
				+    "    history_langchain_format.append(HumanMessage(content=message))\n",
			
 
				+    "    gpt_response = llm(message) #history_langchain_format)\n",
			
 
				+    "    return gpt_response#.content\n",
			
 
				+    "\n",
			
 
				+    "gr.ChatInterface(predict).launch()"
			
 
				+   ]
			
 
				+  }
			
 
				+ ],
			
 
				+ "metadata": {
			
 
				+  "kernelspec": {
			
 
				+   "display_name": "Python 3 (ipykernel)",
			
 
				+   "language": "python",
			
 
				+   "name": "python3"
			
 
				+  },
			
 
				+  "language_info": {
			
 
				+   "codemirror_mode": {
			
 
				+    "name": "ipython",
			
 
				+    "version": 3
			
 
				+   },
			
 
				+   "file_extension": ".py",
			
 
				+   "mimetype": "text/x-python",
			
 
				+   "name": "python",
			
 
				+   "nbconvert_exporter": "python",
			
 
				+   "pygments_lexer": "ipython3",
			
 
				+   "version": "3.8.18"
			
 
				+  }
			
 
				+ },
			
 
				+ "nbformat": 4,
			
 
				+ "nbformat_minor": 5
			
 
				+}
			
--- a/demo_apps/README.md
+++ b/demo_apps/README.md
--- a/demo_apps/StructuredLlama.ipynb
+++ b/demo_apps/StructuredLlama.ipynb
--- a/demo_apps/VideoSummary.ipynb
+++ b/demo_apps/VideoSummary.ipynb
--- a/demo_apps/csv2db.py
+++ b/demo_apps/csv2db.py
@@ -0,0 +1,38 @@
 
				+import sqlite3
			
 
				+import csv
			
 
				+
			
 
				+# Define the input CSV file and the SQLite database file
			
 
				+input_csv = 'nba_roster.csv'
			
 
				+database_file = 'nba_roster.db'
			
 
				+
			
 
				+# Connect to the SQLite database
			
 
				+conn = sqlite3.connect(database_file)
			
 
				+cursor = conn.cursor()
			
 
				+
			
 
				+# Create a table to store the data
			
 
				+cursor.execute('''CREATE TABLE IF NOT EXISTS nba_roster (
			
 
				+                    Team TEXT,
			
 
				+                    NAME TEXT,
			
 
				+                    Jersey TEXT,
			
 
				+                    POS TEXT,
			
 
				+                    AGE INT,
			
 
				+                    HT TEXT,
			
 
				+                    WT TEXT,
			
 
				+                    COLLEGE TEXT,
			
 
				+                    SALARY TEXT
			
 
				+                )''')
			
 
				+
			
 
				+# Read data from the CSV file and insert it into the SQLite table
			
 
				+with open(input_csv, 'r', newline='') as csvfile:
			
 
				+    csv_reader = csv.reader(csvfile)
			
 
				+    next(csv_reader)  # Skip the header row
			
 
				+    
			
 
				+    for row in csv_reader:
			
 
				+        cursor.execute('INSERT INTO nba_roster VALUES (?, ?, ?, ?, ?, ?, ?, ?, ?)', row)
			
 
				+
			
 
				+# Commit the changes and close the database connection
			
 
				+conn.commit()
			
 
				+conn.close()
			
 
				+
			
 
				+print(f'Data from {input_csv} has been successfully imported into {database_file}')
			
 
				+
			
--- a/demo_apps/llama2-gradio.png
+++ b/demo_apps/llama2-gradio.png
--- a/demo_apps/llama2-streamlit.png
+++ b/demo_apps/llama2-streamlit.png
--- a/demo_apps/llama2-streamlit2.png
+++ b/demo_apps/llama2-streamlit2.png
--- a/demo_apps/llama2.pdf
+++ b/demo_apps/llama2.pdf
--- a/demo_apps/nba.txt
+++ b/demo_apps/nba.txt
--- a/demo_apps/streamlit_llama2.py
+++ b/demo_apps/streamlit_llama2.py
@@ -0,0 +1,22 @@
 
				+import streamlit as st
			
 
				+from langchain.llms import Replicate
			
 
				+import os
			
 
				+
			
 
				+st.title("Llama2-powered Streamlit App")
			
 
				+
			
 
				+with st.sidebar:
			
 
				+    os.environ["REPLICATE_API_TOKEN"] = "<your replicate api token>"
			
 
				+
			
 
				+def generate_response(input_text):
			
 
				+    llama2_13b_chat = "meta/llama-2-13b-chat:f4e2de70d66816a838a89eeeb621910adffb0dd0baba3976c96980970978018d"
			
 
				+
			
 
				+    llm = Replicate(
			
 
				+        model=llama2_13b_chat,
			
 
				+        model_kwargs={"temperature": 0.01, "top_p": 1, "max_new_tokens":500}
			
 
				+    )
			
 
				+    st.info(llm(input_text))
			
 
				+
			
 
				+with st.form("my_form"):
			
 
				+    text = st.text_area("Enter text:", "What is Generative AI?")
			
 
				+    submitted = st.form_submit_button("Submit")
			
 
				+    generate_response(text)
			
--- a/demo_apps/txt2csv.py
+++ b/demo_apps/txt2csv.py
@@ -0,0 +1,53 @@
 
				+import csv
			
 
				+
			
 
				+# Define the input and output file names
			
 
				+input_file = 'nba.txt'
			
 
				+output_file = 'nba_roster.csv'
			
 
				+
			
 
				+# Initialize lists to store data
			
 
				+roster_data = []
			
 
				+current_team = None
			
 
				+
			
 
				+# Open the input file
			
 
				+with open(input_file, 'r') as file:
			
 
				+    for line in file:
			
 
				+        # Remove leading and trailing whitespaces from the line
			
 
				+        line = line.strip()
			
 
				+        
			
 
				+        # Check if the line starts with 'https', skip it
			
 
				+        if line.startswith('https'):
			
 
				+            continue
			
 
				+        
			
 
				+        # Check if the line contains the team name
			
 
				+        if 'Roster' in line:
			
 
				+            current_team = line.split(' Roster ')[0]
			
 
				+        elif line and "NAME" not in line:  # Skip empty lines and header lines
			
 
				+            # Split the line using tabs as the delimiter
			
 
				+            player_info = line.split('\t')
			
 
				+            
			
 
				+            # Remove any numbers from the player's name and set Jersey accordingly
			
 
				+            name = ''.join([c for c in player_info[0] if not c.isdigit()])
			
 
				+            jersey = ''.join([c for c in player_info[0] if c.isdigit()])
			
 
				+            
			
 
				+            # If no number found, set Jersey to "NA"
			
 
				+            if not jersey:
			
 
				+                jersey = "NA"
			
 
				+            
			
 
				+            # Append the team name, name, and jersey to the player's data
			
 
				+            player_info = [current_team, name, jersey] + player_info[1:]
			
 
				+            
			
 
				+            # Append the player's data to the roster_data list
			
 
				+            roster_data.append(player_info)
			
 
				+
			
 
				+# Write the data to a CSV file
			
 
				+with open(output_file, 'w', newline='') as csvfile:
			
 
				+    writer = csv.writer(csvfile)
			
 
				+    
			
 
				+    # Write the header row
			
 
				+    writer.writerow(['Team', 'NAME', 'Jersey', 'POS', 'AGE', 'HT', 'WT', 'COLLEGE', 'SALARY'])
			
 
				+    
			
 
				+    # Write the player data
			
 
				+    writer.writerows(roster_data)
			
 
				+
			
 
				+print(f'Conversion completed. Data saved to {output_file}')
			
 
				+
			
--- a/docs/FAQ.md
+++ b/docs/FAQ.md
--- a/scripts/spellcheck_conf/wordlist.txt
+++ b/scripts/spellcheck_conf/wordlist.txt
@@ -1158,3 +1158,30 @@ GBs
 
				 MLP
			
 
				 learnable
			
 
				 tokenized
			
 
				+Colab
			
 
				+GenAI
			
 
				+Gradio
			
 
				+HelloLlama
			
 
				+HelloLlamaCloud
			
 
				+HelloLlamaLocal
			
 
				+LLM's
			
 
				+LangChain
			
 
				+LangChain's
			
 
				+LiveData
			
 
				+LlamaIndex
			
 
				+MBP
			
 
				+MLC
			
 
				+Replicate's
			
 
				+StructuredLlama
			
 
				+VideoSummary
			
 
				+cpp
			
 
				+envinronment
			
 
				+ggml
			
 
				+gguf
			
 
				+gradio
			
 
				+minnutes
			
 
				+pdf
			
 
				+quantized
			
 
				+serarch
			
 
				+streamlit
			
 
				+