Fine Tuning - a imjliao Collection

imjliao 's Collections

Agent

Prompt

Entity

Information Retrieval

QA

Document Information Extraction

MLLM

AIF

Models

Fine Tuning

updated Oct 30, 2023

Tuna: Instruction Tuning using Feedback from Large Language Models

Paper • 2310.13385 • Published Oct 20, 2023 • 10
Contrastive Prefence Learning: Learning from Human Feedback without RL

Paper • 2310.13639 • Published Oct 20, 2023 • 25
Teaching Language Models to Self-Improve through Interactive Demonstrations

Paper • 2310.13522 • Published Oct 20, 2023 • 12
Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 122
Table-GPT: Table-tuned GPT for Diverse Table Tasks

Paper • 2310.09263 • Published Oct 13, 2023 • 41