LLMs for Engineers
Subscribe
Sign in
Home
Archive
About
Latest
Top
Discussions
Putting AI-Generated Clinical Text to the Test: What Experts Found
…healthcare applications demand uncompromising accuracy, and innovation is key.
Dec 12, 2024
•
Suchismita Padhy
4
Share this post
LLMs for Engineers
Putting AI-Generated Clinical Text to the Test: What Experts Found
Copy link
Facebook
Email
Notes
More
November 2024
Low-Budget Judge for High-End Hallucination Verdicts
… boosting LLM accuracy by >5% amidst label scarcity and budget constraints.
Nov 21, 2024
•
Daniel Omeiza
4
Share this post
LLMs for Engineers
Low-Budget Judge for High-End Hallucination Verdicts
Copy link
Facebook
Email
Notes
More
August 2024
LLMs Know More Than What They Say
... and how that provides winning evals
Aug 15, 2024
•
Ruby Pai
16
Share this post
LLMs for Engineers
LLMs Know More Than What They Say
Copy link
Facebook
Email
Notes
More
June 2024
Pytest is All You Need
… for LLM evaluation when you have reference data and metrics.
Jun 12, 2024
•
Wenzhe Xue
4
Share this post
LLMs for Engineers
Pytest is All You Need
Copy link
Facebook
Email
Notes
More
January 2024
Scaling human feedback with fine-tuned open-source LLMs
Discover how Llama and Mistral can improve accuracy at scale
Jan 30, 2024
•
Wenzhe Xue
3
Share this post
LLMs for Engineers
Scaling human feedback with fine-tuned open-source LLMs
Copy link
Facebook
Email
Notes
More
November 2023
Hybrid Evaluation: Scaling human feedback with custom evaluation models
...how to really get model based evals to work for you
Nov 15, 2023
•
Ansup Babu
and
Arjun Bansal
8
Share this post
LLMs for Engineers
Hybrid Evaluation: Scaling human feedback with custom evaluation models
Copy link
Facebook
Email
Notes
More
October 2023
Which Llama-2 Inference API should I use?
understanding the complete trade-offs of Llama-2 providers
Oct 31, 2023
•
Wenzhe Xue
2
Share this post
LLMs for Engineers
Which Llama-2 Inference API should I use?
Copy link
Facebook
Email
Notes
More
Ready, Set, Test: Building Evaluation into Your LLM Workflow
... with llmeval
Oct 13, 2023
•
Niklas Nielsen
Share this post
LLMs for Engineers
Ready, Set, Test: Building Evaluation into Your LLM Workflow
Copy link
Facebook
Email
Notes
More
August 2023
How do I evaluate LLM coding agents? 🧑💻
...aka when can I hire an AI software engineer?
Aug 31, 2023
•
Arjun Bansal
1
Share this post
LLMs for Engineers
How do I evaluate LLM coding agents? 🧑💻
Copy link
Facebook
Email
Notes
More
🕵️🗺️ Where do I deploy Llama-2? 🦙🦙
We share the most cost efficient way to run Llama-2
Aug 22, 2023
•
Arjun Bansal
3
Share this post
LLMs for Engineers
🕵️🗺️ Where do I deploy Llama-2? 🦙🦙
Copy link
Facebook
Email
Notes
More
Llama-2 and the open source LLM 🌊
Anyone can own and run full stack LLM applications like never before
Aug 3, 2023
•
Arjun Bansal
1
Share this post
LLMs for Engineers
Llama-2 and the open source LLM 🌊
Copy link
Facebook
Email
Notes
More
July 2023
Evaluating LLM Agents and Applications
A lot of AI research such as HELM and BigBench has been devoted to building test suites to evaluate the accuracy of large language models.
Jul 11, 2023
•
Arjun Bansal
4
Share this post
LLMs for Engineers
Evaluating LLM Agents and Applications
Copy link
Facebook
Email
Notes
More
1
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts