Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Matthew Hollings's picture
2 2 57

Matthew Hollings

matthh
·
https://applyingai.dev/
  • mattholl

AI & ML interests

Generative AI, computational creativity, reinforcement learning

Organizations

None yet

matthh 's collections 2

RL
  • Direct Preference Optimization: Your Language Model is Secretly a Reward Model

    Paper • 2305.18290 • Published May 29, 2023 • 66
  • PERL: Parameter Efficient Reinforcement Learning from Human Feedback

    Paper • 2403.10704 • Published Mar 15, 2024 • 60
To Read
  • Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads to Answers Faster

    Paper • 2311.08263 • Published Nov 14, 2023 • 16
RL
  • Direct Preference Optimization: Your Language Model is Secretly a Reward Model

    Paper • 2305.18290 • Published May 29, 2023 • 66
  • PERL: Parameter Efficient Reinforcement Learning from Human Feedback

    Paper • 2403.10704 • Published Mar 15, 2024 • 60
To Read
  • Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads to Answers Faster

    Paper • 2311.08263 • Published Nov 14, 2023 • 16
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs