shugavaneshwar's picture

shugavaneshwar

Nickstrain

·

https://github.com/NickStrain

AI & ML interests

NLP:Computer Vision

Recent Activity

reacted to Kseniase's post with 🔥 about 5 hours ago

8 New Types of RAG RAG techniques continuously evolve to enhance LLM response accuracy by retrieving relevant external data during generation. To keep up with current AI trends, new RAG types incorporate deep step-by-step reasoning, tree search, citations, multimodality and other effective techniques. Here's a list of 8 latest RAG advancements: 1. DeepRAG -> https://huggingface.co/papers/2502.01142 Models retrieval-augmented reasoning as a Markov Decision Process, enabling strategic retrieval. It dynamically decides when to retrieve external knowledge and when rely on parametric reasoning. 2. RealRAG -> https://huggingface.co/papers/2502.00848 Enhances novel object generation by retrieving real-world images and using self-reflective contrastive learning to fill knowledge gap, improve realism and reduce distortions. 3. Chain-of-Retrieval Augmented Generation (CoRAG) -> https://huggingface.co/papers/2501.14342 Retrieves information step-by-step and adjusts it, also deciding how much compute power to use at test time. If needed it reformulates queries. 4. VideoRAG -> https://huggingface.co/papers/2501.05874 Enables unlimited-length video processing, using dual-channel architecture that integrates graph-based textual grounding and multi-modal context encoding. 5. CFT-RAG -> https://huggingface.co/papers/2501.15098 A tree-RAG acceleration method uses an improved Cuckoo Filter to optimize entity localization, enabling faster retrieval. 6. Contextualized Graph RAG (CG-RAG) -> https://huggingface.co/papers/2501.15067 Uses Lexical-Semantic Graph Retrieval (LeSeGR) to integrate sparse and dense signals within graph structure and capture citation relationships 7. GFM-RAG -> https://huggingface.co/papers/2502.01113 A graph foundation model that uses a graph neural network to refine query-knowledge connections 8. URAG -> https://huggingface.co/papers/2501.16276 A hybrid system combining rule-based and RAG methods to improve lightweight LLMs for educational chatbots

upvoted a paper 12 days ago

UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

upvoted a paper 12 days ago

TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate

View all activity

Organizations

None yet

upvoted 2 papers 12 days ago

UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

Paper • 2603.03241 • Published Mar 3 • 87

TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate

Paper • 2504.19874 • Published Apr 28, 2025 • 32

upvoted 3 papers about 1 month ago

Efficient Memory Management for Large Language Model Serving with PagedAttention

Paper • 2309.06180 • Published Sep 12, 2023 • 51

Beyond RAG for Agent Memory: Retrieval by Decoupling and Aggregation

Paper • 2602.02007 • Published Feb 2 • 18

LMCache: An Efficient KV Cache Layer for Enterprise-Scale LLM Inference

Paper • 2510.09665 • Published Oct 8, 2025 • 5

upvoted 2 papers 2 months ago

Green-VLA: Staged Vision-Language-Action Model for Generalist Robots

Paper • 2602.00919 • Published Jan 31 • 324

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published Feb 2 • 263

upvoted a paper 5 months ago

Continuous Autoregressive Language Models

Paper • 2510.27688 • Published Oct 31, 2025 • 74

upvoted a paper 6 months ago

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5, 2025 • 140