1 11 22

Archit Singhal

singhalarchit

AI & ML interests

None yet

Recent Activity

liked a model 10 days ago

SulphurAI/Sulphur-2-base

authored a paper 8 months ago

From Frames to Clips: Efficient Key Clip Selection for Long-Form Video Understanding

upvoted a paper 8 months ago

From Frames to Clips: Efficient Key Clip Selection for Long-Form Video Understanding

View all activity

Organizations

None yet

upvoted a paper 8 months ago

From Frames to Clips: Efficient Key Clip Selection for Long-Form Video Understanding

Paper • 2510.02262 • Published Oct 2, 2025 • 3

upvoted 2 articles about 1 year ago

Article

How to generate text: using different decoding methods for language generation with Transformers

patrickvonplaten

•

Mar 1, 2020

• 298

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

natolambert, LouisCastricato, lvwerra, Dahoas

•

Dec 9, 2022

• 414

upvoted a paper over 1 year ago

Ring Attention with Blockwise Transformers for Near-Infinite Context

Paper • 2310.01889 • Published Oct 3, 2023 • 13

upvoted 7 articles over 1 year ago

Article

Vision Language Models Explained

merve, edbeeching

•

Apr 11, 2024

• 531

Article

Breaking resolution curse of vision-language models

visheratin

•

Feb 24, 2024

• 22

Article

A Dive into Vision-Language Models

adirik, sayakpaul

•

Feb 3, 2023

• 84

Article

Design choices for Vision Language Models in 2024

gigant

•

Apr 16, 2024

• 34

Article

Document Similarity Search with ColPali

fsommers

•

Sep 21, 2024

• 52

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

merve, andsteing, pcuenq

•

May 14, 2024

• 287

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

manu

•

Jul 5, 2024

• 317

Archit Singhal

AI & ML interests

Recent Activity

Organizations

singhalarchit's activity

How to generate text: using different decoding methods for language generation with Transformers

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Vision Language Models Explained

Breaking resolution curse of vision-language models

A Dive into Vision-Language Models

Design choices for Vision Language Models in 2024

Document Similarity Search with ColPali

PaliGemma – Google's Cutting-Edge Open Vision Language Model

ColPali: Efficient Document Retrieval with Vision Language Models 👀