LLMs for Engineers
Subscribe
Sign in
Home
Archive
About
ai
LLMs Know More Than What They Say
... and how that provides winning evals
Aug 15
•
Ruby Pai
11
Share this post
LLMs Know More Than What They Say
arjunbansal.substack.com
Copy link
Facebook
Email
Note
Other
Hybrid Evaluation: Scaling human feedback with custom evaluation models
...how to really get model based evals to work for you
Nov 15, 2023
•
Ansup Babu
and
Arjun Bansal
8
Share this post
Hybrid Evaluation: Scaling human feedback with custom evaluation models
arjunbansal.substack.com
Copy link
Facebook
Email
Note
Other
Which Llama-2 Inference API should I use?
understanding the complete trade-offs of Llama-2 providers
Oct 31, 2023
•
Wenzhe Xue
2
Share this post
Which Llama-2 Inference API should I use?
arjunbansal.substack.com
Copy link
Facebook
Email
Note
Other
Ready, Set, Test: Building Evaluation into Your LLM Workflow
... with llmeval
Oct 13, 2023
•
Niklas Nielsen
Share this post
Ready, Set, Test: Building Evaluation into Your LLM Workflow
arjunbansal.substack.com
Copy link
Facebook
Email
Note
Other
How do I evaluate LLM coding agents? 🧑💻
...aka when can I hire an AI software engineer?
Aug 31, 2023
•
Arjun Bansal
1
Share this post
How do I evaluate LLM coding agents? 🧑💻
arjunbansal.substack.com
Copy link
Facebook
Email
Note
Other
🕵️🗺️ Where do I deploy Llama-2? 🦙🦙
We share the most cost efficient way to run Llama-2
Aug 22, 2023
•
Arjun Bansal
3
Share this post
🕵️🗺️ Where do I deploy Llama-2? 🦙🦙
arjunbansal.substack.com
Copy link
Facebook
Email
Note
Other
Llama-2 and the open source LLM 🌊
Anyone can own and run full stack LLM applications like never before
Aug 3, 2023
•
Arjun Bansal
1
Share this post
Llama-2 and the open source LLM 🌊
arjunbansal.substack.com
Copy link
Facebook
Email
Note
Other
Evaluating LLM Agents and Applications
A lot of AI research such as HELM and BigBench has been devoted to building test suites to evaluate the accuracy of large language models.
Jul 11, 2023
•
Arjun Bansal
4
Share this post
Evaluating LLM Agents and Applications
arjunbansal.substack.com
Copy link
Facebook
Email
Note
Other
Evolution of LLM Agents
...and how to avert a crisis on further progress!
Jun 21, 2023
•
Arjun Bansal
and
Niklas Nielsen
4
Share this post
Evolution of LLM Agents
arjunbansal.substack.com
Copy link
Facebook
Email
Note
Other
3 ways to improve LLM Agent chains with debugging
Tl;dr: Cost, reliability & accuracy
May 3, 2023
•
Arjun Bansal
3
Share this post
3 ways to improve LLM Agent chains with debugging
arjunbansal.substack.com
Copy link
Facebook
Email
Note
Other
3
Share
Copy link
Facebook
Email
Note
Other
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts