Research notes – Maxime Labonne

Training Data Influence Analysis and Estimation A Survey

Machine Learning

📝 Paper: https://arxiv.org/pdf/2212.04612.pdf

Self-Rewarding Language Models

Large Language Models

This paper introduces Iterative DPO and use it to improve the performance of Llama 2 70B on the AlpacaEval…

Local Large Language Models – Int8

Large Language Models

Quantization

📝 Article: https://int8.io/local-large-language-models-beginners-guide/

LoraHub – Efficient Cross-Task Generalization via Dynamic LoRA Composition

Large Language Models

This paper describes a framework to combine LoRA modules to achieve adaptable performance on unseen tasks.…

Multipack Sampler

Large Language Models

💻 GitHub: https://github.com/imoneoi/multipack_sampler

InCoder – A Generative Model for Code Infilling and Synthesis

Large Language Models

InCoder is a 1.3B parameter LLM that can generate code (left-to-right) as well as editing (via masking and…

Orca – Progressive Learning from Complex Explanation Traces of GPT-4

Large Language Models

Orca is a 13B parameter LLM with ChatGPT level of performance thanks to a huge dataset of 5M samples with…

phi-1 – Textbooks Are All You Need

Large Language Models

It introduces phi-1, a model with 1.3B parameters that obtains a pass@1 rate of 50.6% on HumanEval thanks to…

LongNet – Scaling Transformers to 1,000,000,000 Tokens

Large Language Models

This paper introduces the dilated attention mechanism, another sparse attention scheme which approximates…

Tart – A plug-and-play Transformer module for task-agnostic reasoning

Large Language Models

Tart combines the performance of fine-tuning with the ease-of-use of in-context learning. It is a general…

Inference Optimization – Lil’Log

Large Language Models

Quantization

📝 Article: https://lilianweng.github.io/posts/2023-01-10-inference-optimization/

Extending the Context Window of LLMs

Large Language Models

📝 Article: https://kaiokendev.github.io/context

GPTQ – Accurate Post-Training Quantization for Generative Pre-trained Transformers

Large Language Models

This paper introduces GPTQ, the first reliable quantization technique to peform 4- and even 3-bit…

LIMA – Less Is More for Alignment

Large Language Models

LIMA is a 65B parameter LLaMA model fine-tuned (supervised learning) using 1,000 samples, which outperforms…

LoRA – Low-Rank Adaptation of Large Language Models

Large Language Models

PEFT approach based on matrix factorization.

Report – Few-Shot Text Classification

Large Language Models

📝 Article: https://few-shot-text-classification.fastforwardlabs.com/ (2020)