Blog-explorers

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Ujjwal-Tyagi new activity about 13 hours ago

blog-explorers/README:Pending Access Request to Join HF Blog Explorers

Reality123b new activity 3 days ago

blog-explorers/README:Pending Blog-Explorers Access Request

adamm-hf new activity 12 days ago

blog-explorers/README:The Next Evolution of AI: From Passive Models to Autonomous Systems

View all activity

Ujjwal-Tyagi

in blog-explorers/README about 13 hours ago

Pending Access Request to Join HF Blog Explorers

#17 opened about 17 hours ago by

AINovice2005

chengkunli

authored a paper 1 day ago

Towards Embodied AI with MuscleMimic: Unlocking full-body musculoskeletal motor learning at scale

Paper • 2603.25544 • Published 23 days ago

wangbuer999

posted an update 1 day ago

Post

2024

Hands-on testing of HY-World 2.0 shows a significant improvement in end-to-end engineering maturity compared to version 1.5

The model supports direct multimodal input from text, single-frame images, and video. Inference can be launched without camera intrinsic/extrinsic calibration or additional preprocessing

After panorama generation, the built-in Spatial Agent automatically performs semantic navigation path planning. Combined with spatial consistency constraints from HY-WorldStereo, it ensures artifact-free multi-view generation and stable geometric alignment

Outputs include standard 3D asset formats such as Mesh, 3DGS, and point clouds, which can be directly imported into Unity/UE

It is suitable for engineering scenarios including game level prototyping, digital twins, and embodied simulation

kargaranamir

authored a paper 1 day ago

GlotOCR Bench: OCR Models Still Struggle Beyond a Handful of Unicode Scripts

Paper • 2604.12978 • Published 4 days ago • 5

kargaranamir

submitted a paper to Daily Papers 2 days ago

GlotOCR Bench: OCR Models Still Struggle Beyond a Handful of Unicode Scripts

Paper • 2604.12978 • Published 4 days ago • 5

Reality123b

in blog-explorers/README 3 days ago

Pending Blog-Explorers Access Request

#10 opened 6 months ago by

KarthikAvinash

namanvats

posted an update 5 days ago

Post

3474

Ran a small controlled study on a frozen 40-task slice of Harbor Terminal-Bench-Pro, using the same model (minimax/minimax-m2.5) with two agent harnesses: Goose and OpenHands-SDK.

Under the base setup, reducing the turn budget from 100 to 60 pushed the two harnesses in opposite directions:

* Goose: 0.450 → 0.525
* OpenHands-SDK: 0.575 → 0.500

A tweaked 60-turn setup brought OpenHands-SDK back to 0.575. At their best, both harnesses reached the same 0.575 pass rate.

What surprised me most was the token profile: in this setup, the reported token usage for OpenHands-SDK was dramatically higher than Goose while converging to the same best score.

Same model, same task slice, different harness behavior under a tighter interaction budget.

Dataset:
namanvats/harbor-goose-openhands-benchmark

Code/configs:
https://github.com/namanvats/harbor-agent-ablation