Posts by Year

2023

1 Year of a Challenging Big-Bench Task

16 minute read

In 2021 I contributed to the Big-Bench suite of NLP tasks, aiming to probe the abilities of large language models. Inspired by sports, I developed a task aim...

Back to Top ↑

2022

PALM

2 minute read

Introduction

SPANBERT

less than 1 minute read

What is the name of the SpanBERT paper? SpanBERT: Improving Pre-training by Representing and Predicting Spans Mandar Joshi, ...

TransformerXL

2 minute read

Introduction Transformer models typically have a fixed context window that is hard to scale due to the $O(n^2)$ cost of the attention mechanism. Extending th...

Back to Top ↑

2021

Back to Top ↑

2020

XGboost Part 1: Gradient Boosting

4 minute read

Introduction Xgboost is a powerful yet simple algorithm that has achieved state of the art results on tabular datasets. The Xgboost algorithm uses an ensembl...

Back to Top ↑