PhD UT Austin
Contact: tushaarg@cs.utexas.edu
(by email), or
4.802C GDC in person
I am a PhD student at UT Austin, advised by Qiang Liu and Atlas Wang. I work on efficient large language models using alternate attention mechanisms.
(Previously,) at Cornell, I was advised by Sasha Rush and Cristian Danescu-Niculescu-Mizil.
Research interests. NLP
:: LLM ::
alternate-attention (efficiency, reasoning)
(See my papers page for full list of
publications.)
Recognition.
Teaching, internships, etc.
| Sp26 | Research Intern, Microsoft Research (Project: Efficient long-context modeling) |
| Su25 | Research Intern, Test-Time Scaling, IBM (Project: Alternate-attention models for long-COT reasoning) |
| Sp25 | Instructor (w/ Karthik Sridharan), Intro to Machine Learning (CS 3780/5780), Cornell |
| Fa24 | Head TA, Practicum in AI (CS 4701), Cornell |
| Su24 | Research Intern, Cornell (Project: Mechanistic interpretability of hybrid attention/RNN models; Advisor: Sasha Rush) |
| Sp24 | Head TA, Language and Information (CS 4300/INFO 4300), Cornell |
| Fa23 | Head TA, Natural Language Processing (CS 4740/LING 4744/COGST 4740/CS 5740), Cornell |
| Su23 | Research Intern, Cornell (Project: Authorship identification in cross-domain settings; Advisor: Sasha Rush) |
| Sp23 | Head TA, Language and Information (CS 4300/INFO 4300), Cornell |
| Fa22 | Head TA, Natural Language Processing (CS 4740/LING 4744/COGST 4740/CS 5740), Cornell |
Misc.