Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -25,19 +25,6 @@ We concentrate on understanding the relationship between internal structure in n
|
|
| 25 |
|
| 26 |
3. **Geometry of Program Synthesis (GPS)**: Applying SLT to study inductive biases, advancing our understanding of how to predict and measure alignment-relevant risks.
|
| 27 |
|
| 28 |
-
## Notable Achievements
|
| 29 |
-
|
| 30 |
-
- Established developmental interpretability as a concrete application of SLT to alignment
|
| 31 |
-
- Developed scalable new measuring tools like the Local Learning Coefficient (LLC)
|
| 32 |
-
- Validated that SLT can make accurate predictions about real-world AI systems
|
| 33 |
-
- Popularized SLT within the AI safety community through conferences, workshops, and collaborations
|
| 34 |
-
|
| 35 |
-
## Key Publications
|
| 36 |
-
|
| 37 |
-
- Quantifying Degeneracy in Singular Models via the learning coefficient (Lau et al. 2023)
|
| 38 |
-
- Estimating the Local Learning Coefficient at Scale (Furman and Lau 2024)
|
| 39 |
-
- The Developmental Landscape of In-Context Learning (Hoogland et al. 2024)
|
| 40 |
-
|
| 41 |
## Resources
|
| 42 |
|
| 43 |
- [DevInterp GitHub Repository](https://github.com/timaeus-research/devinterp)
|
|
|
|
| 25 |
|
| 26 |
3. **Geometry of Program Synthesis (GPS)**: Applying SLT to study inductive biases, advancing our understanding of how to predict and measure alignment-relevant risks.
|
| 27 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 28 |
## Resources
|
| 29 |
|
| 30 |
- [DevInterp GitHub Repository](https://github.com/timaeus-research/devinterp)
|