An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
-
Updated
Jun 2, 2024 - Jupyter Notebook
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
SimPO: Simple Preference Optimization with a Reference-Free Reward
Ingest files for retrieval augmented generation (RAG) with open-source Large Language Models (LLMs), all without 3rd parties or sensitive data leaving your network.
Evaluating LLMs' Cognitive Behavioral Reasoning for Cybersecurity
This package brings language model capabilities into the coding environment, providing a variety of functionalities.
The framework for fast development and deployment of RAG systems.
Possibly futile attempt at grounding hype with theory and fundamentals
Explain a black-box module in natural language.
Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)
Seamlessly integrate state-of-the-art transformer models into robotics stacks
⛓️ Langflow is a visual framework for building multi-agent and RAG applications. It's open-source, Python-powered, fully customizable, LLM and vector store agnostic.
Labs for Finetuning Large Language Models by DeepLearning.AI
The all-in-one LLM developer platform: prompt management, evaluation, human feedback, and deployment all in one place.
The official code for CoT / ZSL reasoning framework 🧠, utilized in paper: "Large Language Models in Targeted Sentiment Analysis in Russian"
A curated list of awesome resources, tools, research papers, and projects related to the concept of Large Language Model Operating Systems (LLM-OS).
🪢 Open source LLM engineering platform: Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
🪢 Langfuse documentation -- Langfuse is the open source LLM Engineering Platform. Observability, evals, prompt management, playground and metrics to debug and improve LLM apps
The official evaluation suite and dynamic data release for MixEval.
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Add a description, image, and links to the large-language-models topic page so that developers can more easily learn about it.
To associate your repository with the large-language-models topic, visit your repo's landing page and select "manage topics."