Spaces:

NeerajCodz
/

scrapeRL

Running

App Files Files Community

scrapeRL / docs /test /real-curl-user-input-10-test-report.md

NeerajCodz

docs: init proto

24f0bf0 13 days ago

preview code

raw

history blame contribute delete

3.96 kB

real-curl-user-style-test-report-10-scenarios

run-context

Timestamp: 2026-04-04T23:08:19.953Z (user-request window)
Stack: docker compose up --build -d
API base used for all calls: http://localhost:3000/api
All requests executed with curl.exe (not mocked HTTP clients)

curl-flow-used

curl.exe -sS -X POST "http://localhost:3000/api/scrape/" \
  -H "Content-Type: application/json" \
  --data-binary "@payload.json"

curl.exe -sS "http://localhost:3000/api/scrape/<session_id>/status"
curl.exe -sS "http://localhost:3000/api/scrape/<session_id>/result"
curl.exe -sS -X DELETE "http://localhost:3000/api/scrape/<session_id>/cleanup"

example-real-request-payload

{
  "session_id": "realcurl-cedd928b3d",
  "assets": ["https://example.com"],
  "instructions": "Extract page title, main summary, and top navigation links useful for a product snapshot.",
  "output_instructions": "Return strict JSON with keys: page_title, summary, links.",
  "output_format": "json",
  "complexity": "low",
  "provider": "nvidia",
  "model": "meta/llama-3.3-70b-instruct",
  "enable_memory": true,
  "enable_plugins": ["mcp-html"],
  "max_steps": 10
}

test-matrix-10-10-real-requests

#	Test	Provider / Model	Assets	Complexity	Format	Memory	Plugins	Final	Steps	Reward
1	ecommerce-low-json	nvidia / meta/llama-3.3-70b-instruct	https://example.com	low	json	on	mcp-html	completed	10	4.834
2	docs-medium-markdown	nvidia / meta/llama-3.3-70b-instruct	https://www.python.org, https://docs.python.org/3/	medium	markdown	on	mcp-search, skill-extractor	completed	31	14.660
3	research-high-json	nvidia / meta/llama-3.3-70b-instruct	https://www.wikipedia.org, https://www.nasa.gov	high	json	on	mcp-browser, skill-planner, proc-json	completed	43	19.580
4	support-low-csv	nvidia / meta/llama-3.3-70b-instruct	https://httpbin.org/html	low	csv	off	none	completed	10	4.834
5	jobs-medium-csv	nvidia / meta/llama-3.3-70b-instruct	https://github.com/trending, https://news.ycombinator.com	medium	csv	on	mcp-search, proc-csv	completed	31	14.660
6	policy-high-text	nvidia / meta/llama-3.3-70b-instruct	https://www.un.org	high	text	on	mcp-browser	completed	22	9.790
7	framework-low-markdown	nvidia / meta/llama-3.3-70b-instruct	https://www.djangoproject.com	low	markdown	on	mcp-html	completed	10	4.834
8	education-medium-json-groq	groq / llama-3.3-70b-versatile	https://www.python.org, https://www.wikipedia.org	medium	json	on	skill-navigator, skill-verifier	completed	31	14.660
9	science-high-csv	nvidia / meta/llama-3.3-70b-instruct	https://www.nasa.gov, https://docs.python.org/3/	high	csv	off	mcp-html, proc-json	completed	43	19.580
10	legal-low-text	nvidia / meta/llama-3.3-70b-instruct	https://en.wikipedia.org/wiki/Terms_of_service	low	text	on	skill-planner	completed	10	4.834

aggregate-outcome

Total tests: 10
Completed: 10
Partial: 0
Failed: 0
Total steps executed: 241 (avg 24.1 per test)
Total reward: 112.266 (avg 11.227 per test)
Total reported errors: 0

notes

These were real curl-driven end-to-end requests with real URL assets and user-style instruction prompts.
Response payloads completed cleanly across low/medium/high complexity, JSON/CSV/Markdown/Text output instructions, memory on/off, and mixed plugin sets.

document-flow

flowchart TD
    A[document] --> B[key-sections]
    B --> C[implementation]
    B --> D[operations]
    B --> E[validation]

related-api-reference

item	value
api-reference	`api-reference.md`