Tất cả prompt

Infographic & biểu đồ

langchain and fireworks just shipped the eval move worth ste

Real prompt shared by @rohit4verse on X

langchain and fireworks just shipped the eval move worth ste

Prompt

langchain and fireworks just shipped the eval move worth stealing: a fine-tuned qwen judge that flags "perceived error" on every production trace and runs up to 100x cheaper than opus. the cost number gets the attention. the transfer result matters more. they trained the judge on one app, their docs q&a agent. then they pointed it at fleet, a separate product, with no retraining. it beat every frontier model on that domain. 90.8% against opus at 90.2%. most evaluators break the second you move them to a new app, because the rubric is app-specific. "perceived error" travels because the signal is behavioral: the user corrects you, or repeats the request. that pattern holds across every product. one design choice stands out. they fed the judge human and ai messages only and dropped every tool call. their bet is that the correction signal lives in the conversation itself. anyone can rent the model in your loop. a judge trained on your own traces, cheap enough to run on all of them, is the moat they cannot buy.

Share

Originally by

Rohit

@rohit4verse · on X

Jun 15, 2026EN

View original source

113likes17reposts15replies30.2Kviews169bookmarksas of Jun 19, 2026

Referenced with attribution — all rights remain with the original creator. Sources & removal

Model: GPT Image 2
Aspect ratio: 1:1
Danh mục: Infographic & biểu đồ

Biên tậpGiáo dụcTối giảnThiết kế phẳngChữTrừu tượng

Tạo phiên bản của riêng bạn

1Sao chép prompt
2Thay bằng chủ thể và chi tiết của riêng bạn
3Tạo ảnh

Công thức liên quan

View all prompts

Based on { TOPIC }

Infographic & biểu đồ

Based on { TOPIC }

Real prompt shared by @oggii_0 on X

Most people talk about Agentic AI.

Infographic & biểu đồ

Most people talk about Agentic AI.

Real prompt shared by @MeenakshiYACS on X

Modern House infographics created using GPT Image 2 on ChatG

Infographic & biểu đồ

Modern House infographics created using GPT Image 2 on ChatG

Real prompt shared by @mehvishs25 on X

Create a premium female beauty & makeup analysis infographic

Infographic & biểu đồ

Create a premium female beauty & makeup analysis infographic

Real prompt shared by @saniaspeaks_ on X

I uploaded one selfie.

Infographic & biểu đồ

I uploaded one selfie.

Real prompt shared by @vishisinghal_ on X

A wide two-column comparison chart with bold headers, aligned rows, and a center divider.

Infographic & biểu đồ

Side-by-side comparison chart

Compare two options in a balanced, scannable layout.

GPT Image 2 on ChatGPT

Infographic & biểu đồ

GPT Image 2 on ChatGPT

Real prompt shared by @john_my07 on X

gpt-image-2

Infographic & biểu đồ

gpt-image-2

Real prompt shared by @rohanpaul_ai on X