LLMs for Engineers
Subscribe
Sign in
Home
Archive
About
evaluation
LLMs Know More Than What They Say
... and how that provides winning evals
Aug 15
•
Ruby Pai
11
Share this post
LLMs Know More Than What They Say
arjunbansal.substack.com
Copy link
Facebook
Email
Note
Other
Which Llama-2 Inference API should I use?
understanding the complete trade-offs of Llama-2 providers
Oct 31, 2023
•
Wenzhe Xue
2
Share this post
Which Llama-2 Inference API should I use?
arjunbansal.substack.com
Copy link
Facebook
Email
Note
Other
How do I evaluate LLM coding agents? 🧑💻
...aka when can I hire an AI software engineer?
Aug 31, 2023
•
Arjun Bansal
1
Share this post
How do I evaluate LLM coding agents? 🧑💻
arjunbansal.substack.com
Copy link
Facebook
Email
Note
Other
Llama-2 and the open source LLM 🌊
Anyone can own and run full stack LLM applications like never before
Aug 3, 2023
•
Arjun Bansal
1
Share this post
Llama-2 and the open source LLM 🌊
arjunbansal.substack.com
Copy link
Facebook
Email
Note
Other
Evaluating LLM Agents and Applications
A lot of AI research such as HELM and BigBench has been devoted to building test suites to evaluate the accuracy of large language models.
Jul 11, 2023
•
Arjun Bansal
4
Share this post
Evaluating LLM Agents and Applications
arjunbansal.substack.com
Copy link
Facebook
Email
Note
Other
Evolution of LLM Agents
...and how to avert a crisis on further progress!
Jun 21, 2023
•
Arjun Bansal
and
Niklas Nielsen
4
Share this post
Evolution of LLM Agents
arjunbansal.substack.com
Copy link
Facebook
Email
Note
Other
Share
Copy link
Facebook
Email
Note
Other
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts