Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,100 @@
|
|
| 1 |
-
---
|
| 2 |
-
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
language: en
|
| 3 |
+
tags:
|
| 4 |
+
- tessar
|
| 5 |
+
- table-question-answering
|
| 6 |
+
- svector
|
| 7 |
+
- neural-sql-executor
|
| 8 |
+
datasets:
|
| 9 |
+
- Stanford/wikitablequestions
|
| 10 |
+
license: mit
|
| 11 |
+
pipeline_tag: table-question-answering
|
| 12 |
+
---
|
| 13 |
+
|
| 14 |
+
# Tessar (Large-Sized Model)
|
| 15 |
+
|
| 16 |
+
Tessar is an advanced table reasoning model developed by SVECTOR, building upon the groundbreaking research and pushing the boundaries of neural table understanding.
|
| 17 |
+
|
| 18 |
+
## Model Description
|
| 19 |
+
|
| 20 |
+
Tessar (**Te**xtual **S**QL **A**nalysis and **R**easoning) is a sophisticated neural model designed to excel in table-based question answering. Tessar implements an innovative neural SQL executor that can interpret and reason over complex tabular data with remarkable precision.
|
| 21 |
+
|
| 22 |
+
The model is constructed using the BART architecture, featuring a bidirectional encoder and an autoregressive decoder. This design allows Tessar to capture intricate contextual relationships within tabular data and generate accurate, contextually relevant answers.
|
| 23 |
+
|
| 24 |
+
### Key Features
|
| 25 |
+
- Advanced neural SQL execution capabilities
|
| 26 |
+
- State-of-the-art performance on complex table question answering
|
| 27 |
+
- Robust handling of nuanced and multi-step queries
|
| 28 |
+
- Fine-tuned on the WikiTableQuestions dataset
|
| 29 |
+
|
| 30 |
+
## Intended Uses
|
| 31 |
+
|
| 32 |
+
Tessar is particularly powerful for solving complex table-based questions across various domains. Here are some example questions the model can effectively address:
|
| 33 |
+
|
| 34 |
+
| Question | Example Answer |
|
| 35 |
+
|:---: |:---:|
|
| 36 |
+
| According to the table, what is the last title produced? | Specific Title |
|
| 37 |
+
| What is the difference in a specific comparative metric? | Numerical Difference |
|
| 38 |
+
| Which entity had the most significant impact in a given context? | Identified Entity |
|
| 39 |
+
| What were the first and last entries in a specific column? | Comparative Entries |
|
| 40 |
+
|
| 41 |
+
### How to Use
|
| 42 |
+
|
| 43 |
+
Here's a comprehensive example of using Tessar with the Transformers library:
|
| 44 |
+
|
| 45 |
+
```python
|
| 46 |
+
from transformers import TessarTokenizer, BartForConditionalGeneration
|
| 47 |
+
import pandas as pd
|
| 48 |
+
|
| 49 |
+
# Load Tessar model and tokenizer
|
| 50 |
+
tokenizer = TessarTokenizer.from_pretrained("SVECTOR-CORPORATION/Tessar-largest")
|
| 51 |
+
model = BartForConditionalGeneration.from_pretrained("SVECTOR-CORPORATION/Tessar-largest")
|
| 52 |
+
|
| 53 |
+
# Prepare sample table data
|
| 54 |
+
data = {
|
| 55 |
+
"year": [1896, 1900, 1904, 2004, 2008, 2012],
|
| 56 |
+
"city": ["athens", "paris", "st. louis", "athens", "beijing", "london"]
|
| 57 |
+
}
|
| 58 |
+
table = pd.DataFrame.from_dict(data)
|
| 59 |
+
|
| 60 |
+
# Ask a specific query
|
| 61 |
+
query = "In which year did beijing host the Olympic Games?"
|
| 62 |
+
encoding = tokenizer(table=table, query=query, return_tensors="pt")
|
| 63 |
+
|
| 64 |
+
# Generate answer
|
| 65 |
+
outputs = model.generate(**encoding)
|
| 66 |
+
|
| 67 |
+
# Decode and print result
|
| 68 |
+
print(tokenizer.batch_decode(outputs, skip_special_tokens=True))
|
| 69 |
+
# Expected output: [' 2008.0']
|
| 70 |
+
```
|
| 71 |
+
|
| 72 |
+
### Evaluation
|
| 73 |
+
|
| 74 |
+
For comprehensive evaluation scripts and benchmarking, please refer to the SVECTOR documentation and research repositories.
|
| 75 |
+
|
| 76 |
+
### Performance Highlights
|
| 77 |
+
- Exceptional accuracy on complex table reasoning tasks
|
| 78 |
+
- Robust handling of multi-step and contextual queries
|
| 79 |
+
- State-of-the-art performance on WikiTableQuestions dataset
|
| 80 |
+
|
| 81 |
+
### Citation
|
| 82 |
+
|
| 83 |
+
If you use Tessar in your research the SVECTOR implementation:
|
| 84 |
+
|
| 85 |
+
```bibtex
|
| 86 |
+
@inproceedings{svector2025tessar,
|
| 87 |
+
title={{Tessar}: Advanced Neural Table Reasoning},
|
| 88 |
+
author={{SVECTOR Team}},
|
| 89 |
+
year={2025},
|
| 90 |
+
institution={SVECTOR Research}
|
| 91 |
+
}
|
| 92 |
+
```
|
| 93 |
+
|
| 94 |
+
### Contact and Support
|
| 95 |
+
|
| 96 |
+
For further information, support, or collaboration opportunities, please contact SVECTOR's research team at research@svector.co.in.
|
| 97 |
+
|
| 98 |
+
### License
|
| 99 |
+
|
| 100 |
+
This model is released under the MIT License. Please review the licensing terms before use.
|