KEMBAR78
OSU Natural Language Processing · GitHub
Skip to content
@OSU-NLP-Group

OSU Natural Language Processing

Popular repositories Loading

  1. HippoRAG HippoRAG Public

    [NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personali…

    Python 2.9k 279

  2. Mind2Web Mind2Web Public

    [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist web agents

    Jupyter Notebook 884 117

  3. SeeAct SeeAct Public

    [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

    Python 790 103

  4. GUI-Agents-Paper-List GUI-Agents-Paper-List Public

    Building a comprehensive and handy list of papers for GUI agents

    Python 531 29

  5. TravelPlanner TravelPlanner Public

    [ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"

    Python 431 63

  6. MagicBrush MagicBrush Public

    [NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".

    Python 379 14

Repositories

Showing 10 of 60 repositories
  • saev Public

    Sparse autoencoders for vision

    OSU-NLP-Group/saev’s past year of commit activity
    Python 47 MIT 6 5 3 Updated Oct 24, 2025
  • Mind2Web-2 Public

    [NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge

    OSU-NLP-Group/Mind2Web-2’s past year of commit activity
    Python 86 MIT 6 0 0 Updated Oct 23, 2025
  • GUI-Agents-Paper-List Public

    Building a comprehensive and handy list of papers for GUI agents

    OSU-NLP-Group/GUI-Agents-Paper-List’s past year of commit activity
    Python 531 29 1 0 Updated Oct 21, 2025
  • GUI-Drag Public
    OSU-NLP-Group/GUI-Drag’s past year of commit activity
    Python 0 0 0 0 Updated Oct 19, 2025
  • Explorer Public

    [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents

    OSU-NLP-Group/Explorer’s past year of commit activity
    Python 19 MIT 0 1 0 Updated Oct 14, 2025
  • LLM-IOAA Public

    Code and data for the paper "Large Language Models Achieve Gold Medal Performance at the International Olympiad on Astronomy & Astrophysics (IOAA)" (https://arxiv.org/abs/2510.05016).

    OSU-NLP-Group/LLM-IOAA’s past year of commit activity
    TeX 13 MIT 1 0 0 Updated Oct 7, 2025
  • TravelPlanner Public

    [ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"

    OSU-NLP-Group/TravelPlanner’s past year of commit activity
    Python 431 MIT 63 0 0 Updated Oct 5, 2025
  • WebDreamer Public

    [TMLR'25] "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"

    OSU-NLP-Group/WebDreamer’s past year of commit activity
    Python 88 5 4 0 Updated Oct 5, 2025
  • HippoRAG Public

    [NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personalized PageRank.

    OSU-NLP-Group/HippoRAG’s past year of commit activity
    Python 2,880 MIT 279 16 3 Updated Sep 4, 2025
  • ScienceAgentBench Public

    [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

    OSU-NLP-Group/ScienceAgentBench’s past year of commit activity
    Python 106 MIT 15 4 0 Updated Aug 26, 2025