researchpilot-api / output.txt
Subhadip007's picture
feat: data ingestion and processing pipeline complete
233102d
��2026-03-13 02:33:43 | INFO  | __main__:main:105 | =======================================================
2026-03-13 02:33:43 | INFO  | __main__:main:106 | RESEARCHPILOT � INGESTION TEST SUITE
2026-03-13 02:33:43 | INFO  | __main__:main:107 | =======================================================
2026-03-13 02:33:43 | INFO  | __main__:main:110 | 
[TEST 1] Checking existing data on disk...
2026-03-13 02:33:43 | INFO  | __main__:test_existing_data:26 | Papers already on disk: 10
2026-03-13 02:33:43 | INFO  | __main__:test_existing_data:37 |  -> 2603.10995: Factorized Neural Implicit DMD for Parametric Dynamics...
2026-03-13 02:33:43 | INFO  | __main__:test_existing_data:38 |  Category: cs.LG | Date: 2026-03-11
2026-03-13 02:33:43 | INFO  | __main__:test_existing_data:37 |  -> 2603.11000: Cross-Species Transfer Learning for Electrophysiology-to-Tra...
2026-03-13 02:33:43 | INFO  | __main__:test_existing_data:38 |  Category: cs.LG | Date: 2026-03-11
2026-03-13 02:33:43 | INFO  | __main__:test_existing_data:37 |  -> 2603.11001: RCTs & Human Uplift Studies: Methodological Challenges and P...
2026-03-13 02:33:43 | INFO  | __main__:test_existing_data:38 |  Category: cs.CY | Date: 2026-03-11
2026-03-13 02:33:43 | INFO  | __main__:main:114 | 
[TEST 2] Schema validation...
2026-03-13 02:33:43 | INFO  | __main__:test_schema_validation:46 | Testing schema validation...
2026-03-13 02:33:43 | INFO  | __main__:test_schema_validation:67 |  -> Schema validation: PASSED
2026-03-13 02:33:43 | INFO  | __main__:test_schema_validation:68 |  paper_id cleaned: '2301.07041'
2026-03-13 02:33:43 | INFO  | __main__:test_schema_validation:69 |  title cleaned: 'Test Paper With Extra Spaces'
2026-03-13 02:33:43 | INFO  | __main__:main:118 | 
[TEST 3] Fresh fetch from ArXiv...
2026-03-13 02:33:43 | INFO  | __main__:test_fresh_fetch:81 | Fetching 3 fresh papers from ArXiv...
2026-03-13 02:33:43 | INFO  | src.ingestion.arxiv_fetcher:__init__:132 | ArXivFetcher initialized. Already have 3 papers indexed.
2026-03-13 02:33:43 | INFO  | src.ingestion.arxiv_fetcher:fetch_papers:262 | Search query: 'cat:cs.LG OR cat:cs.AI'
2026-03-13 02:33:43 | INFO  | src.ingestion.arxiv_fetcher:fetch_papers:263 | Target: '3 papers'
2026-03-13 02:33:43 | INFO  | src.ingestion.arxiv_fetcher:fetch_papers:278 | Starting ArXiv fetch...
2026-03-13 02:33:43 | INFO  | src.ingestion.arxiv_fetcher:fetch_papers:317 | Fetch complete.Fetched: 3 | Skipped (duplicate): 0 | Skipped (invalid): 0
2026-03-13 02:33:43 | INFO  | __main__:test_fresh_fetch:96 |  -> Fresh fetch: PASSED. Got 3 papers
2026-03-13 02:33:43 | INFO  | __main__:test_fresh_fetch:98 |  2603.11048: COMIC: Agentic Sketch Comedy Generation...
2026-03-13 02:33:43 | INFO  | __main__:test_fresh_fetch:98 |  2603.11047: LiTo: Surface Light Field Tokenization...
2026-03-13 02:33:43 | INFO  | __main__:test_fresh_fetch:98 |  2603.11045: Neural Field Thermal Tomography: A Differentiable Physi...
2026-03-13 02:33:43 | INFO  | __main__:main:121 | 
=======================================================
2026-03-13 02:33:43 | INFO  | __main__:main:122 | TEST SUITE COMPLETE
2026-03-13 02:33:43 | INFO  | __main__:main:123 | Existing papers: 3 shown (may have more)
2026-03-13 02:33:43 | INFO  | __main__:main:124 | Fresh papers fetched: 3
2026-03-13 02:33:43 | INFO  | __main__:main:125 | =======================================================