Karim shoair commited on
Commit ·
73a751a
1
Parent(s): a120517
docs: updating benchmarks page
Browse files- docs/benchmarks.md +14 -13
docs/benchmarks.md
CHANGED
|
@@ -1,21 +1,20 @@
|
|
| 1 |
# Performance Benchmarks
|
| 2 |
|
| 3 |
-
Scrapling isn't just powerful—it's also blazing fast
|
| 4 |
-
|
| 5 |
-
## Benchmark Results
|
| 6 |
|
| 7 |
### Text Extraction Speed Test (5000 nested elements)
|
| 8 |
|
| 9 |
| # | Library | Time (ms) | vs Scrapling |
|
| 10 |
|---|:-----------------:|:---------:|:------------:|
|
| 11 |
-
| 1 | Scrapling |
|
| 12 |
-
| 2 | Parsel/Scrapy | 2.
|
| 13 |
-
| 3 | Raw Lxml |
|
| 14 |
-
| 4 | PyQuery |
|
| 15 |
-
| 5 | Selectolax |
|
| 16 |
-
| 6 |
|
| 17 |
-
| 7 |
|
| 18 |
-
| 8 | BS4 with html5lib |
|
|
|
|
| 19 |
|
| 20 |
### Element Similarity & Text Search Performance
|
| 21 |
|
|
@@ -23,5 +22,7 @@ Scrapling's adaptive element finding capabilities significantly outperform alter
|
|
| 23 |
|
| 24 |
| Library | Time (ms) | vs Scrapling |
|
| 25 |
|-------------|:---------:|:------------:|
|
| 26 |
-
| Scrapling | 2.
|
| 27 |
-
| AutoScraper |
|
|
|
|
|
|
|
|
|
| 1 |
# Performance Benchmarks
|
| 2 |
|
| 3 |
+
Scrapling isn't just powerful—it's also blazing fast. The following benchmarks compare Scrapling's parser with the latest versions of other popular libraries.
|
|
|
|
|
|
|
| 4 |
|
| 5 |
### Text Extraction Speed Test (5000 nested elements)
|
| 6 |
|
| 7 |
| # | Library | Time (ms) | vs Scrapling |
|
| 8 |
|---|:-----------------:|:---------:|:------------:|
|
| 9 |
+
| 1 | Scrapling | 2.02 | 1.0x |
|
| 10 |
+
| 2 | Parsel/Scrapy | 2.04 | 1.01 |
|
| 11 |
+
| 3 | Raw Lxml | 2.54 | 1.257 |
|
| 12 |
+
| 4 | PyQuery | 24.17 | ~12x |
|
| 13 |
+
| 5 | Selectolax | 82.63 | ~41x |
|
| 14 |
+
| 6 | MechanicalSoup | 1549.71 | ~767.1x |
|
| 15 |
+
| 7 | BS4 with Lxml | 1584.31 | ~784.3x |
|
| 16 |
+
| 8 | BS4 with html5lib | 3391.91 | ~1679.1x |
|
| 17 |
+
|
| 18 |
|
| 19 |
### Element Similarity & Text Search Performance
|
| 20 |
|
|
|
|
| 22 |
|
| 23 |
| Library | Time (ms) | vs Scrapling |
|
| 24 |
|-------------|:---------:|:------------:|
|
| 25 |
+
| Scrapling | 2.39 | 1.0x |
|
| 26 |
+
| AutoScraper | 12.45 | 5.209x |
|
| 27 |
+
|
| 28 |
+
> All benchmarks represent averages of 100+ runs. See [benchmarks.py](https://github.com/D4Vinci/Scrapling/blob/main/benchmarks.py) for methodology.
|