Tag: LLMs • Mimansa Jaiswal

Dec 24, 2024

LLM (ML) Job Interviews (Fall 2024) - Process

A retelling of my experience interviewing for ML/LLM research science/engineering focused roles in Fall 2024.

Jun 25, 2023

The Curious Case of LLM Evaluations

Our modeling, scaling and generalization techniques grew faster than our benchmarking abilities - which in turn have resulted in poor evaluation and hyped capabilities.

🪴 Potted LLMs Evaluation Survey

Apr 10, 2023

No, GPT4 (RLHF’ed) Does Not Pass The Sally-Anne False Belief Test

Discussing the prospect of deriving instinct and purpose for a prompt and creating examples for evaluation problems focussing the Sally-Anne False-Belief Test and provide a summary of when GPT4 and GPT3.5 pass or fail the test.

LLMs Evaluation

Dec 24, 2024

LLM (ML) Job Interviews - Resources

A collection of all the resources I used to prepare for my ML/LLM research science/engineering focused interviews in Fall 2024.

🌲 Evergreen LLMs NLP Research Guide List

Aug 1, 2024

I am Stuck in a Loop of Datasets ↔ Techniques

I keep jumping between - I do not trust the evaluation, the data is poor, to the dataset I created only has 100 samples.

Thoughts Research LLMs

Jul 21, 2024

Add packages to ChatGPT code interpreter environment

Code LLMs

Mar 4, 2024

A Personal Test Suite for LLMs

Most LLM benchmarks are either academic or do not capture what I use them for. So, inspired by some other people, this is my own test suite.

🪴 Potted NLP LLMs Evaluation

Dec 13, 2023

Random Research Ideas On Social Media That I Liked

Sometimes I come across random research ideas across the twitter and social media universe that really resonate with me at that moment. Often they get lost in doom scrolling, so I am considering compiling those into a running log.

LLMs NLP Research Information

Dec 9, 2023

Biases in ML

LLMs Thoughts

Oct 31, 2023

LLM Powered Literature Reviews

LLMs Research Tools