LLMs for Engineers
Subscribe
Sign in
Home
Archive
About
Latest
Top
Discussions
Low-Budget Judge for High-End Hallucination Verdicts
… boosting LLM accuracy by >5% amidst label scarcity and budget constraints.
Nov 21
•
Daniel Omeiza
4
Share this post
LLMs for Engineers
Low-Budget Judge for High-End Hallucination Verdicts
Copy link
Facebook
Email
Notes
More
August 2024
LLMs Know More Than What They Say
... and how that provides winning evals
Aug 15
•
Ruby Pai
15
Share this post
LLMs for Engineers
LLMs Know More Than What They Say
Copy link
Facebook
Email
Notes
More
June 2024
Pytest is All You Need
… for LLM evaluation when you have reference data and metrics.
Jun 12
•
Wenzhe Xue
4
Share this post
LLMs for Engineers
Pytest is All You Need
Copy link
Facebook
Email
Notes
More
January 2024
Scaling human feedback with fine-tuned open-source LLMs
Discover how Llama and Mistral can improve accuracy at scale
Jan 30
•
Wenzhe Xue
3
Share this post
LLMs for Engineers
Scaling human feedback with fine-tuned open-source LLMs
Copy link
Facebook
Email
Notes
More
November 2023
Hybrid Evaluation: Scaling human feedback with custom evaluation models
...how to really get model based evals to work for you
Nov 15, 2023
•
Ansup Babu
and
Arjun Bansal
8
Share this post
LLMs for Engineers
Hybrid Evaluation: Scaling human feedback with custom evaluation models
Copy link
Facebook
Email
Notes
More
October 2023
Which Llama-2 Inference API should I use?
understanding the complete trade-offs of Llama-2 providers
Oct 31, 2023
•
Wenzhe Xue
2
Share this post
LLMs for Engineers
Which Llama-2 Inference API should I use?
Copy link
Facebook
Email
Notes
More
Ready, Set, Test: Building Evaluation into Your LLM Workflow
... with llmeval
Oct 13, 2023
•
Niklas Nielsen
Share this post
LLMs for Engineers
Ready, Set, Test: Building Evaluation into Your LLM Workflow
Copy link
Facebook
Email
Notes
More
August 2023
How do I evaluate LLM coding agents? 🧑💻
...aka when can I hire an AI software engineer?
Aug 31, 2023
•
Arjun Bansal
1
Share this post
LLMs for Engineers
How do I evaluate LLM coding agents? 🧑💻
Copy link
Facebook
Email
Notes
More
🕵️🗺️ Where do I deploy Llama-2? 🦙🦙
We share the most cost efficient way to run Llama-2
Aug 22, 2023
•
Arjun Bansal
3
Share this post
LLMs for Engineers
🕵️🗺️ Where do I deploy Llama-2? 🦙🦙
Copy link
Facebook
Email
Notes
More
Llama-2 and the open source LLM 🌊
Anyone can own and run full stack LLM applications like never before
Aug 3, 2023
•
Arjun Bansal
1
Share this post
LLMs for Engineers
Llama-2 and the open source LLM 🌊
Copy link
Facebook
Email
Notes
More
July 2023
Evaluating LLM Agents and Applications
A lot of AI research such as HELM and BigBench has been devoted to building test suites to evaluate the accuracy of large language models.
Jul 11, 2023
•
Arjun Bansal
4
Share this post
LLMs for Engineers
Evaluating LLM Agents and Applications
Copy link
Facebook
Email
Notes
More
June 2023
Evolution of LLM Agents
...and how to avert a crisis on further progress!
Jun 21, 2023
•
Arjun Bansal
and
Niklas Nielsen
4
Share this post
LLMs for Engineers
Evolution of LLM Agents
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts