Posts by Collection

awards

M.H. Stewart Fellowship

Published: January 01, 2023

Fellowship awarded by Georgia Tech.

Margaret and Stephen Kendrick PhD Student Fellowship for Research Excellence

Published: January 01, 2025

Fellowship for research excellence awarded by Georgia Tech.

Best Poster Award (1st Place), AASF AIX Summit 2026

Published: January 01, 2026

Best Poster Award (1st Place) at AASF AIX Summit 2026 for the Free Energy Mixer poster.

portfolio

publications

ARM: Refining Multivariate Forecasting with Adaptive Temporal‑Contextual Learning

Published in ICLR 2024 (Poster), 2024

Presents ARM with AUEL, Random Dropping, and multi‑kernel local smoothing to better capture series‑wise patterns and inter‑series dependencies for long‑term multivariate TSF.

Download Paper

Download Poster

CATS: Enhancing Multivariate Time Series Forecasting by Constructing Auxiliary Time Series as Exogenous Variables

Published in ICML 2024 (Poster), PMLR 235: 32990–33006, 2024

Constructs Auxiliary Time Series (ATS) as exogenous inputs to capture inter‑series relations; identifies continuity, sparsity, and variability principles; improves multivariate TSF even with simple predictors.

Download Paper

Download Poster

In‑context Time Series Predictor

Published in ICLR 2025 (Poster), 2025

Reformulates TSF as in‑context learning by constructing tokens of (lookback, future) task pairs, enabling Transformers to adapt predictors from context without parameter updates.

Download Paper

Download Poster

WAVE: Weighted Autoregressive Varying Gate for Time Series Forecasting

Published in ICML 2025 (Poster), PMLR 267: 40464–40490, 2025

Adds ARMA structure to autoregressive attention via a weighted varying gate, decoupling long‑range and local effects and improving TSF quality without increasing asymptotic complexity.

Download Paper

Download Poster

Linear Transformers as VAR Models: Aligning Autoregressive Attention Mechanisms with Autoregressive Forecasting

Published in ICML 2025 (Poster), PMLR 267: 40848–40867, 2025

Shows that a linear attention layer can be interpreted as a dynamic VAR; proposes SAMoVAR to realign multi‑layer Transformers with autoregressive forecasting for improved interpretability and accuracy.

Download Paper

Download Poster

ZeroS: Zero‑Sum Linear Attention for Efficient Transformers

Published in NeurIPS 2025 (Spotlight), 2025

Introduces Zero‑Sum Linear Attention (ZeroS), which removes the uniform zero‑order term and reweights residuals to enable stable positive/negative attention weights, allowing contrastive operations within a single layer while retaining O(N) complexity.

Download Paper

Download Poster

Free Energy Mixer

Published in ICLR 2026, 2026

Introduces Free Energy Mixer (FEM), which interprets (q,k) attention scores as a prior and performs a log-sum-exp free-energy readout to reweight values at the channel level, enabling a smooth transition from mean aggregation to selective channel-wise retrieval without increasing asymptotic complexity.

Download Paper

Download Poster

StretchTime: Adaptive Time Series Forecasting via Symplectic Attention

Published in ICML 2026, 2026

Introduces adaptive time series forecasting via symplectic attention, developed through mentored undergraduate research with Jiecheng Lu as corresponding author.

arXiv

Poster page

HyperMLP: An Integrated Perspective for Sequence Modeling

Published in ICML 2026, 2026

Presents an integrated dynamic-MLP perspective on sequence modeling, reinterpreting attention heads through context-instantiated MLP computation and learnable sequence-space mixing.

arXiv

Poster page

teaching

ISyE 4031 Regression and Forecasting

Independent Instructor, Georgia Institute of Technology, 2026

Independent instructor for ISyE 4031 Regression and Forecasting at Georgia Tech in Summer 2026.

Jiecheng Lu

Posts by Collection

awards

M.H. Stewart Fellowship

Margaret and Stephen Kendrick PhD Student Fellowship for Research Excellence

Best Poster Award (1st Place), AASF AIX Summit 2026

portfolio

publications

ARM: Refining Multivariate Forecasting with Adaptive Temporal‑Contextual Learning

CATS: Enhancing Multivariate Time Series Forecasting by Constructing Auxiliary Time Series as Exogenous Variables

In‑context Time Series Predictor

WAVE: Weighted Autoregressive Varying Gate for Time Series Forecasting

Linear Transformers as VAR Models: Aligning Autoregressive Attention Mechanisms with Autoregressive Forecasting

ZeroS: Zero‑Sum Linear Attention for Efficient Transformers

Free Energy Mixer

StretchTime: Adaptive Time Series Forecasting via Symplectic Attention

HyperMLP: An Integrated Perspective for Sequence Modeling

talks

Rethinking Sequence Modeling: LLM Scaling Laws, Expressivity-Efficiency Tradeoffs, and the Role of Architecture

Rethinking Sequence Modeling with HyperMLP: An Integrated Architectural Perspective

Rethinking Sequence Modeling: LLM Scaling Laws, Expressivity-Efficiency Tradeoffs, and the Role of Architecture

teaching

ISyE 4031 Regression and Forecasting