Nessuna descrizione

770 Commit

Suraj Subramanian 9944493de7 escape pipe		1 anno fa
.github	ed4b914cd6 refactored PyTest workflow to run on generic GHA hosted Ubuntu runner.	1 anno fa
docs	6d449a859b New folder structure (#1)	1 anno fa
recipes	5979dbe996 Merging local with remote	1 anno fa
scripts	18d76ed36f merging into private llama recipes repo	1 anno fa
src	5979dbe996 Merging local with remote	1 anno fa
tests	362cda0fa6 fixing test_gradient_accumulation and test_save_to_json	1 anno fa
.gitignore	c2ca88d3d1 fix in gitignore	1 anno fa
CODE_OF_CONDUCT.md	4767f09ecd Initial commit	1 anno fa
CONTRIBUTING.md	e9d1e627a6 Added install from source commands with optional dependencies	1 anno fa
LICENSE	4767f09ecd Initial commit	1 anno fa
README.md	9944493de7 escape pipe	1 anno fa
UPDATES.md	12602f32e2 Merge branch 'main' into subramen-patch-deadlinks	1 anno fa
USE_POLICY.md	4767f09ecd Initial commit	1 anno fa
dev_requirements.txt	16fa3352bb Added dev_req.txt files	1 anno fa
pyproject.toml	0022d97163 remove decapoda-research/llama-7b-hf tokenizer and skip tests if meta-llama/Llama-2-7b is not available	1 anno fa
requirements.txt	a404c9249c Notebook to demonstrate using llama and llama-guard together using OctoAI	1 anno fa

Llama Recipes: Examples to get started using the Llama models from Meta

The 'llama-recipes' repository is a companion to the Llama 2 model. The goal of this repository is to provide a scalable library for fine-tuning Llama 2, along with some example scripts and notebooks to quickly get started with using the Llama 2 models in a variety of use-cases, including fine-tuning for domain adaptation and building LLM-based applications with Llama 2 and other tools in the LLM ecosystem. The examples here showcase how to run Llama 2 locally, in the cloud, and on-prem.

[!IMPORTANT] Llama 3 has a new prompt template and special tokens (based on the tiktoken tokenizer). | Token | Description | |---|---| <\|begin_of_text\|> | This is equivalent to the BOS token. | <\|eot_id\|> | This signifies the end of the message in a turn. This is equivalent to the EOS token. | <\|start_header_id\|>{role}<\|end_header_id\|> | These tokens enclose the role for a particular message. The possible roles can be: system, user, assistant. |

A multiturn-conversation with Llama 3 follows this prompt template:

> <|begin_of_text|><|start_header_id|>system<|end_header_id|>
>
> {{ system_prompt }}<|eot_id|><|start_header_id|>user<|end_header_id|>
>
> {{ user_message_1 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
>
> {{ model_answer_1 }}<|eot_id|><|start_header_id|>user<|end_header_id|>
>
> {{ user_message_2 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
> ```
> More details on the new tokenizer and prompt template: <PLACEHOLDER_URL>


> [!NOTE]
> The llama-recipes repository was recently refactored to promote a better developer experience of using the examples. Some files have been moved to new locations. The `src/` folder has NOT been modified, so the functionality of this repo and package is not impacted.
> 
> Make sure you update your local clone by running `git pull origin main`

## Table of Contents

- [Llama Recipes: Examples to get started using the Llama models from Meta](#llama-recipes-examples-to-get-started-using-the-llama-models-from-meta)
  - [Table of Contents](#table-of-contents)
  - [Getting Started](#getting-started)
    - [Prerequisites](#prerequisites)
      - [PyTorch Nightlies](#pytorch-nightlies)
    - [Installing](#installing)
      - [Install with pip](#install-with-pip)
      - [Install with optional dependencies](#install-with-optional-dependencies)
      - [Install from source](#install-from-source)
    - [Getting the Llama models](#getting-the-llama-models)
      - [Model conversion to Hugging Face](#model-conversion-to-hugging-face)
  - [Repository Organization](#repository-organization)
    - [`recipes/`](#recipes)
    - [`src/`](#src)
  - [Contributing](#contributing)
  - [License](#license)

## Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes. See deployment for notes on how to deploy the project on a live system.

### Prerequisites

#### PyTorch Nightlies
Some features (especially fine-tuning with FSDP + PEFT) currently require PyTorch nightlies to be installed. Please make sure to install the nightlies if you're using these features following [this guide](https://pytorch.org/get-started/locally/).

### Installing
Llama-recipes provides a pip distribution for easy install and usage in other projects. Alternatively, it can be installed from source.

#### Install with pip

pip install --extra-index-url https://download.pytorch.org/whl/test/cu118 llama-recipes


#### Install with optional dependencies
Llama-recipes offers the installation of optional packages. There are three optional dependency groups.
To run the unit tests we can install the required dependencies with:

pip install --extra-index-url https://download.pytorch.org/whl/test/cu118 llama-recipes[tests]

For the vLLM example we need additional requirements that can be installed with:

pip install --extra-index-url https://download.pytorch.org/whl/test/cu118 llama-recipes[vllm]

To use the sensitive topics safety checker install with:

pip install --extra-index-url https://download.pytorch.org/whl/test/cu118 llama-recipes[auditnlg]

Optional dependencies can also be combines with [option1,option2].

#### Install from source
To install from source e.g. for development use these commands. We're using hatchling as our build backend which requires an up-to-date pip as well as setuptools package.

git clone git@github.com:meta-llama/llama-recipes.git cd llama-recipes pip install -U pip setuptools pip install --extra-index-url https://download.pytorch.org/whl/test/cu118 -e .

For development and contributing to llama-recipes please install all optional dependencies:

git clone git@github.com:meta-llama/llama-recipes.git cd llama-recipes pip install -U pip setuptools pip install --extra-index-url https://download.pytorch.org/whl/test/cu118 -e .[tests,auditnlg,vllm]



### Getting the Llama models
You can find Llama 2 models on Hugging Face hub [here](https://huggingface.co/meta-llama), **where models with `hf` in the name are already converted to Hugging Face checkpoints so no further conversion is needed**. The conversion step below is only for original model weights from Meta that are hosted on Hugging Face model hub as well.

#### Model conversion to Hugging Face
The recipes and notebooks in this folder are using the Llama 2 model definition provided by Hugging Face's transformers library.

Given that the original checkpoint resides under models/7B you can install all requirements and convert the checkpoint with:

```bash
## Install Hugging Face Transformers from source
pip freeze | grep transformers ## verify it is version 4.31.0 or higher

git clone git@github.com:huggingface/transformers.git
cd transformers
pip install protobuf
python src/transformers/models/llama/convert_llama_weights_to_hf.py \
   --input_dir /path/to/downloaded/llama/weights --model_size 7B --output_dir /output/path

Repository Organization

Most of the code dealing with Llama usage is organized across 2 main folders: recipes/ and src/.

`recipes/`

Contains examples are organized in folders by topic: | Subfolder | Description | |---|---| quickstart | The "Hello World" of using Llama2, start here if you are new to using Llama2. finetuning|Scripts to finetune Llama2 on single-GPU and multi-GPU setups inference|Scripts to deploy Llama2 for inference locally and using model servers use_cases|Scripts showing common applications of Llama2 responsible_ai|Scripts to use PurpleLlama for safeguarding model outputs llama_api_providers|Scripts to run inference on Llama via hosted endpoints benchmarks|Scripts to benchmark Llama 2 models inference on various backends code_llama|Scripts to run inference with the Code Llama models evaluation|Scripts to evaluate fine-tuned Llama2 models using lm-evaluation-harness from EleutherAI

`src/`

Contains modules which support the example recipes: | Subfolder | Description | |---|---| | configs | Contains the configuration files for PEFT methods, FSDP, Datasets, Weights & Biases experiment tracking. | | datasets | Contains individual scripts for each dataset to download and process. Note | | inference | Includes modules for inference for the fine-tuned models. | | model_checkpointing | Contains FSDP checkpoint handlers. | | policies | Contains FSDP scripts to provide different policies, such as mixed precision, transformer wrapping policy and activation checkpointing along with any precision optimizer (used for running FSDP with pure bf16 mode). | | utils | Utility files for:
- train_utils.py provides training/eval loop and more train utils.
- dataset_utils.py to get preprocessed datasets.
- config_utils.py to override the configs received from CLI.
- fsdp_utils.py provides FSDP wrapping policy for PEFT methods.
- memory_utils.py context manager to track different memory stats in train loop. |

Contributing

Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests to us.

License

See the License file here and Acceptable Use Policy here

README.md