Spaces:

smolagents
/

ml-intern

Running on CPU Upgrade

App Files Files Community

akseljoonas HF Staff commited on Jan 2

Commit

a2e2d22

1 Parent(s): 9934918

improved search agent prompt and descriptions

Browse files

Files changed (2) hide show

agent/prompts/search_docs_system_prompt.yaml +11 -13
agent/tools/search_docs_tool.py +69 -36

agent/prompts/search_docs_system_prompt.yaml CHANGED Viewed

@@ -1,22 +1,21 @@
 search_docs_system_prompt: |
-  You are a specialized documentation search agent. Your task is to comprehensively search and synthesize information from Hugging Face documentation.
   # Search Strategy
   You must search thoroughly before synthesizing results. Follow this approach:
-  1. **Query Analysis**: Identify the core concepts and intent of the query
   2. **Initial Search**: Start with a broad search capturing the main topic
-  3. **Iterative Refinement**: Run multiple searches to go deeper into topics. You will see parsed HTML pages, also look into links on the html pages for best information - first-pass results often miss key details
-  4. **You must get to the end truth**: You must get to the bottom of the truth for this search query. You CAN NOT say that somebody should look up documentation. You must look it up yourself and give the best answer you can.
-  ## Query Formulation Best Practices
-  - Add relevant synonyms and related technical terms
-  - Remove filler words, focus on searchable concepts
-  - Break complex questions into focused sub-queries
-  - Include domain-specific terminology when applicable
-  - Try both specific terms and general related terms
   # Response Guidelines
@@ -25,14 +24,13 @@ search_docs_system_prompt: |
   1. **Analyze Relevance**: Evaluate which results directly answer the query
   2. **Synthesize**: Combine information from multiple sources when applicable
   3. **Prioritize**: Present information in order of relevance
-  4. **Cite Sources**: Reference which documents you're drawing from especially include relevant code samples and links to the code samples.
   5. **Acknowledge Gaps**: If documents don't fully answer the query, explicitly state this
   6. **Handle Conflicts**: If sources contradict, note this and explain your reasoning
-  7. **Be Concise**: Provide a clear, direct answer without unnecessary elaboration
   # Constraints
   - Only provide information found in the documentation
   - Do not make assumptions beyond what the sources state
   - If information is not found, say so clearly rather than guessing
-  - Focus on answering the query directly

 search_docs_system_prompt: |
+  You are a specialized documentation search agent. Your task is to comprehensively search and synthesize information from Hugging Face documentation. You are queried by a main agent who has to build a solution to a user. You have to give the best and the most comprehensive guidance on how to solve the user's task.
   # Search Strategy
   You must search thoroughly before synthesizing results. Follow this approach:
+  1. **Query Analysis**: Identify the core concepts and intent of the original user query and the search query passed by the LLM.
   2. **Initial Search**: Start with a broad search capturing the main topic
+  3. **Iterative Refinement**: Run multiple searches to go deeper into topics. If you see links to other pages, also look into those pages for best information - first-pass results often miss key details
+  4. **You must get to the end truth**: You must get to the bottom of the truth for this search query. You CAN NOT say that somebody should look up documentation. You must look it up yourself and give the best answer you can including code snippets and relevant information. You are teaching the main agent how to solve the user's task and have to give ALL relevant information on how to do it.
+  # Quality metrics:
+  - You are optimizing for the minimum viable way to solve the user request reusing as much as possible from already available code from your research. Opt for reliability and reusability. Hugging Face has a lot of best practices laid out in the documentation and you must pass these to the main agent.
+  # Useful links:
+  - code examples for trl (covers most LLM training tasks): https://github.com/huggingface/trl/tree/main/examples/scripts and https://github.com/huggingface/trl/tree/main/trl/scripts
   # Response Guidelines
   1. **Analyze Relevance**: Evaluate which results directly answer the query
   2. **Synthesize**: Combine information from multiple sources when applicable
   3. **Prioritize**: Present information in order of relevance
+  4. **Cite Sources**: Find and pass the relevant code and other snippets from the analyzed articles for the main agent to read.
   5. **Acknowledge Gaps**: If documents don't fully answer the query, explicitly state this
   6. **Handle Conflicts**: If sources contradict, note this and explain your reasoning
   # Constraints
   - Only provide information found in the documentation
   - Do not make assumptions beyond what the sources state
   - If information is not found, say so clearly rather than guessing
+  - Focus on giving the best practices and comprehensive guidance on how to solve the user's task. Include all relevant code snippets without edits from the docs and simplest ways on how to solve the user's task.

agent/tools/search_docs_tool.py CHANGED Viewed

@@ -96,10 +96,14 @@ async def search_docs_handler(arguments: dict[str, Any]) -> tuple[str, bool]:
         Tuple of (search_results, success)
     """
     query = arguments.get("query", "")
     if not query:
         return "Error: No search query provided", False
     try:
         # Import at runtime to avoid circular dependency
         from pathlib import Path
@@ -149,9 +153,12 @@ async def search_docs_handler(arguments: dict[str, Any]) -> tuple[str, bool]:
                 ),
             )
             # Run the sub-agent
             result = await Handlers.run_agent(
-                session=sub_session, text=query, max_iterations=30
             )
         # Return the final result or compiled events
@@ -163,41 +170,6 @@ async def search_docs_handler(arguments: dict[str, Any]) -> tuple[str, bool]:
         return f"Error in search_docs tool: {str(e)}", False
-# Tool specification to be used by the main agent
-SEARCH_DOCS_TOOL_SPEC = {
-    "name": "search_docs",
-    "description": (
-        "Intelligently search HF documentation for libraries, repositories, and best practices with an agent that has access to: explore_hf_docs, fetch_hf_docs, search_hf_api_endpoints. "
-        "The agent acts like your personal search assistant. "
-        "Using the search agent is necessary to give the best quality answer to the user's question. Most questions require a search to get the best information on code examples.\n\n"
-        "WHEN TO USE THIS TOOL:\n"
-        "  - When searching for high-level concepts like 'how to do GRPO training on a model?' or 'best way to do inference on a trained model?'\n"
-        "  - When you need to get code examples for intricate ML code patterns like training loops, inference pipelines, data processing, etc.\n\n"
-        "USAGE GUIDELINES:\n"
-        "  1. Launch multiple agents concurrently for better performance.\n"
-        "  2. Be specific in your query - include exact terminology, expected file locations, or code patterns.\n"
-        "  3. Use the query as if you were talking to another engineer. Bad: logger impl Good: where is the logger implemented, we're trying to find out how to log to files.\n"
-        "  4. Make sure to formulate the query in such a way that the agent knows when it's done or has found the result."
-    ),
-    "parameters": {
-        "type": "object",
-        "properties": {
-            "query": {
-                "type": "string",
-                "description": (
-                    "The search query describing to the agent what it should do. Be "
-                    "specific and include technical terms, file types, or expected "
-                    "code patterns to help the agent find relevant code. Formulate "
-                    "the query in a way that makes it clear to the agent when it "
-                    "has found the right thing."
-                ),
-            },
-        },
-        "required": ["query"],
-    },
-}
 async def make_search_agent_tools():
     """
     Create a list of tools for the search agent
@@ -237,3 +209,64 @@ async def make_search_agent_tools():
             handler=search_openapi_handler,
         ),
     ]

         Tuple of (search_results, success)
     """
     query = arguments.get("query", "")
+    user_query = arguments.get("user_query", "")
     if not query:
         return "Error: No search query provided", False
+    if not user_query:
+        return "Error: No user query provided", False
     try:
         # Import at runtime to avoid circular dependency
         from pathlib import Path
                 ),
             )
+            # make search prompt
+            search_prompt = f"What the user tasked the main agent with: {user_query}\nWhat you have asked to research by the main agent: {query}. Use both to find the best practices, code examples, and determine the recommended approach for solving the user's task."
             # Run the sub-agent
             result = await Handlers.run_agent(
+                session=sub_session, text=search_prompt, max_iterations=30
             )
         # Return the final result or compiled events
         return f"Error in search_docs tool: {str(e)}", False
 async def make_search_agent_tools():
     """
     Create a list of tools for the search agent
             handler=search_openapi_handler,
         ),
     ]
+# Tool specification to be used by the main agent
+SEARCH_DOCS_TOOL_SPEC = {
+    "name": "research_solution",
+    "description": (
+        "Spawns a specialized research sub-agent to search to find best practices, locate code examples, and determine the recommended approach for solving the user's task.\n\n"
+        "SEARCH AGENT CAPABILITIES:\n"
+        "The search subagent has access to these specialized tools:\n"
+        "  - explore_hf_docs: Discovers documentation structure by parsing sidebar navigation, returns page titles, URLs, and content glimpses\n"
+        "  - fetch_hf_docs: Retrieves full markdown content from specific HF documentation pages\n"
+        "  - search_hf_api_endpoints: Searches HF OpenAPI specification by tag to find API endpoints with usage examples\n"
+        "  - GitHub tools: search_code, search_repositories, get_file_contents, list_issues, list_pull_requests (for searching HF repositories)\n"
+        "MANDATORY FIRST STEP for:\n"
+        "  - ANY task involving training, fine-tuning, or model deployment with HF libraries\n"
+        "  - Implementing ML workflows (data loading, preprocessing, training loops, inference pipelines)\n"
+        "  - Working with specific HF libraries (transformers, diffusers, trl, datasets, accelerate, etc.)\n"
+        "  - Finding the recommended/official way to accomplish ML tasks\n"
+        "  - Understanding which libraries and methods to use for a user's goal\n\n"
+        "ALSO USE for:\n"
+        "  - Verifying current API signatures, parameters, or available methods\n"
+        "  - Finding code examples and best practices from official documentation\n"
+        "  - Understanding relationships between HF libraries and components\n\n"
+        "SKIP ONLY when:\n"
+        "  - User asks simple factual questions answerable from general ML knowledge (e.g., 'What is fine-tuning?')\n"
+        "  - Task is about general Python/programming unrelated to ML or HF libraries\n"
+        "QUERY FORMAT:\n"
+        "Write queries as if delegating to an engineer. Include:\n"
+        "  - Specific library names (e.g., 'trl', 'transformers', 'diffusers')\n"
+        "  - Technical terminology from the domain (e.g., 'DPO trainer', 'GRPO', 'LoRA adapter')\n"
+        "  - Clear success criteria (e.g., 'find code example', 'verify parameter exists', 'get recommended approach')\n\n"
+        "QUERY EXAMPLES:\n"
+        "  Good: 'Find the best way to implement DPO training in trl. Get code example showing dataset format, trainer configuration, and reward model setup'\n"
+        "  Bad: 'dpo trainer'\n"
+        "  Good: 'Search transformers docs for the recommended approach to load and run quantized models with 4-bit precision. Find the specific classes and methods to use'\n"
+        "  Bad: 'quantization'\n"
+        "  Good: 'Research the best way to fine-tune a diffusion model for custom image generation. Find which library to use (diffusers/PEFT), required components, and complete training example'\n"
+        "  Bad: 'fine-tune diffusion'\n\n"
+    ),
+    "parameters": {
+        "type": "object",
+        "properties": {
+            "user_query": {
+                "type": "string",
+                "description": (
+                    "The original user query that you received. This will be used to search the documentation."
+                ),
+            },
+            "query": {
+                "type": "string",
+                "description": (
+                    "Detailed search query for the specialized agent. Must include: (1) specific library/component names, "
+                    "(2) technical terms or concepts to search for, (3) clear objective (e.g., 'find code example', "
+                    "'verify API exists', 'get implementation details'). The search agent will autonomously explore "
+                    "documentation structure, retrieve relevant pages, and compile results until the objective is met."
+                ),
+            },
+        },
+        "required": ["user_query", "query"],
+    },
+}