Rethinking Sequence Modeling: LLM Scaling Laws, Expressivity-Efficiency Tradeoffs, and the Role of Architecture
PhD Seminar, Georgia Tech Machine Learning Student Seminar, C1115 Druid Hills, CODA building, Atlanta, GA
ML PhD seminar talk at Georgia Tech on scaling laws, expressivity-efficiency tradeoffs, and the role of architecture in sequence modeling.
