Skip to content
View shamangary's full-sized avatar

Block or report shamangary

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 724 45 Updated Jan 26, 2025

official code of partial connection adpation

Python 1 Updated Jan 24, 2025

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 4,041 291 Updated Oct 5, 2024

[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"

Python 299 42 Updated Dec 11, 2024

Grandmaster-Level Chess Without Search

Python 548 30 Updated Jan 10, 2025
Python 1,949 125 Updated Jan 16, 2025

Fast and memory-efficient exact attention

Python 15,184 1,435 Updated Jan 18, 2025

(MAF-YOLOv2) with high parameter utilization and high precision

Python 26 1 Updated Jan 24, 2025

Welcome to the Table Meets LLM repository!

Python 27 3 Updated Jan 21, 2025

LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.

Python 300 13 Updated Jan 13, 2025
Python 701 74 Updated Jun 20, 2023

Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

Python 287 17 Updated Jan 17, 2025

Get your documents ready for gen AI

Python 19,133 1,011 Updated Jan 26, 2025
Python 309 24 Updated Dec 31, 2024

An AI Hedge Fund Team

Python 7,122 1,375 Updated Jan 25, 2025

Open source Claude Artifacts – built with Llama 3.1 405B

TypeScript 5,328 1,130 Updated Jan 22, 2025

Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.

Rust 53,479 3,414 Updated Jan 26, 2025
TypeScript 9,507 522 Updated Jan 26, 2025

A Comprehensive Benchmark for Document Parsing and Evaluation

Python 210 20 Updated Jan 17, 2025

Long Context Transfer from Language to Vision

Python 357 19 Updated Nov 20, 2024

【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling

109 4 Updated Oct 18, 2024

A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.

Python 160 11 Updated May 23, 2024

YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis

Python 83 16 Updated Jan 6, 2025

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,166 587 Updated Apr 16, 2024
Python 3,317 297 Updated Oct 16, 2024

Everything about the SmolLM2 and SmolVLM family of models

Python 1,619 85 Updated Jan 24, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 5,585 442 Updated Jan 5, 2025
Next