Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
imjliao 's Collections
Agent
Summarization
Reasoning
Prompt
Synthetic Data
Dialogue
Entity
Information Retrieval
QA
Document Information Extraction
Long Context
Document AI
Tool Use
Fine Tuning
MLLM
AIF
Models

Fine Tuning

updated Oct 30, 2023
Upvote
-

  • Tuna: Instruction Tuning using Feedback from Large Language Models

    Paper • 2310.13385 • Published Oct 20, 2023 • 10

  • Contrastive Prefence Learning: Learning from Human Feedback without RL

    Paper • 2310.13639 • Published Oct 20, 2023 • 25

  • Teaching Language Models to Self-Improve through Interactive Demonstrations

    Paper • 2310.13522 • Published Oct 20, 2023 • 12

  • Zephyr: Direct Distillation of LM Alignment

    Paper • 2310.16944 • Published Oct 25, 2023 • 122

  • Table-GPT: Table-tuned GPT for Diverse Table Tasks

    Paper • 2310.09263 • Published Oct 13, 2023 • 41
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs