Use llama2-wrapper as your local llama2 backend for Generative Agents/Apps. Llama2.c is a tool to Train the Llama 2 LLM architecture in PyTorch then inference it with one simple 700-line C file (run.c ...