Siddharth Sharma – Medium

Siddharth Sharma

Siddharth Sharma

AWS Blog In Collaboration With Nvidia — Optimizing Inference For Seq2Seq And Encoder Only Models…

Posted on November 22, 2023 by Siddharth Sharma

1 min readNov 22, 2023

--

--

Siddharth Sharma

Compressing LLMs With Low Rank Decomposition Of Attention Matrices

Colab Link To Reproduce Experiment: LLM Compression Via Low Rank Decomposition.ipynb

5 min readNov 22, 2023

--

Compressing LLMs With Low Rank Decomposition Of Attention Matrices

--

Siddharth Sharma

Summary Of Adapter Based Performance Efficient Fine Tuning (PEFT) Techniques For Large Language…

The two most common transfer learning techniques in NLP were feature-based transfer (generating input text embedding from a pre-trained…

5 min readApr 21, 2023

--

Summary Of Adapter Based Performance Efficient Fine Tuning (PEFT) Techniques For Large Language…

--

Siddharth Sharma

Neural Ranking Architectures

Glimpses On Implicit/Explicit, Dense/Sparse, Gated/Non Gated, Low Rank And Many More Layered Interactions

8 min readJan 19, 2023

--

Neural Ranking Architectures

--

Siddharth Sharma

Anatomy Of A Model Inference Service

Context :

8 min readJan 14, 2023

--

Anatomy Of A Model Inference Service

--

Siddharth Sharma

Feature Fusion For The Uninitiated

Consider a typical e-commerce product. It would have a variety of content specific features like product title, brand, thumbnail etc and…

7 min readJan 13, 2023

--

Feature Fusion For The Uninitiated

--

Siddharth Sharma

Search Query Understanding

Introduction:

12 min readJan 25, 2021

--

Search Query Understanding

--

Siddharth Sharma

Of Bandits And Bidding

Real-time bidding(RTB) refers to the buying and selling of online ad impressions through real-time auctions that occur in the time it takes…

14 min readMay 9, 2016

--

2

Of Bandits And Bidding

--

2

Siddharth Sharma

Siddharth Sharma

Machine Learning Tech Lead Amazon https://www.linkedin.com/in/siddharth-sharma-31140210/

Following

Help
Status
About
Careers
Blog
Privacy
Terms
Text to speech
Teams