fix: add span for prompt template rendering #2165

MichaelDoyle · 2025-02-25T03:23:27Z

Fixes #2064

Checklist (if applicable):

PR title is following https://www.conventionalcommits.org/en/v1.0.0/
Tested (manually, unit tested, etc.)
Docs updated (updated docs or a docs bug required)

pavelgj · 2025-02-25T14:22:47Z

Can you share the code snippet that produces the trace in the screenshot?

MichaelDoyle · 2025-02-25T14:40:20Z

hello.history.prompt

---
model: googleai/gemini-1.5-flash
config:
  maxOutputTokens: 2048
  temperature: 0.6
  topK: 16
  topP: 0.95
  safetySettings:
    - category: HARM_CATEGORY_HATE_SPEECH
      threshold: BLOCK_ONLY_HIGH
    - category: HARM_CATEGORY_DANGEROUS_CONTENT
      threshold: BLOCK_ONLY_HIGH
    - category: HARM_CATEGORY_HARASSMENT
      threshold: BLOCK_ONLY_HIGH
    - category: HARM_CATEGORY_SEXUALLY_EXPLICIT
      threshold: BLOCK_ONLY_HIGH
input:
  schema:
    name: string
    persona?: string
  default:
    persona: Space Pirate
---

{{role "system"}}
You are a helpful AI assistant that really loves to make impressions.
{{role "user"}}
Say hello to Michael in the voice of a Space Pirate.
{{role "model"}}
Shiver me timbers, matey! We be sailing the solar winds!
{{role "user"}}
Say hello to {{name}} in the voice of a {{persona}}.

flowDotPromptHistory

export const HelloSchema = z.object({
  name: z.string(),
  persona: z.string().optional(),
});

ai.defineFlow(
  {
    name: 'flowDotPromptHistory',
    inputSchema: HelloSchema,
    outputSchema: z.any(),
  },
  async (input) => {
    const hello = ai.prompt('hello', {
      variant: 'history',
    });
    return (await hello(input)).text;
  }
);

schnecle

This will be hugely helpful for understanding when rendering issues are causing problems. Thanks Mike!

pavelgj · 2025-02-26T14:19:47Z

hello.history.prompt

---
model: googleai/gemini-1.5-flash
config:
  maxOutputTokens: 2048
  temperature: 0.6
  topK: 16
  topP: 0.95
  safetySettings:
    - category: HARM_CATEGORY_HATE_SPEECH
      threshold: BLOCK_ONLY_HIGH
    - category: HARM_CATEGORY_DANGEROUS_CONTENT
      threshold: BLOCK_ONLY_HIGH
    - category: HARM_CATEGORY_HARASSMENT
      threshold: BLOCK_ONLY_HIGH
    - category: HARM_CATEGORY_SEXUALLY_EXPLICIT
      threshold: BLOCK_ONLY_HIGH
input:
  schema:
    name: string
    persona?: string
  default:
    persona: Space Pirate
---

{{role "system"}}
You are a helpful AI assistant that really loves to make impressions.
{{role "user"}}
Say hello to Michael in the voice of a Space Pirate.
{{role "model"}}
Shiver me timbers, matey! We be sailing the solar winds!
{{role "user"}}
Say hello to {{name}} in the voice of a {{persona}}.

flowDotPromptHistory

export const HelloSchema = z.object({
  name: z.string(),
  persona: z.string().optional(),
});

ai.defineFlow(
  {
    name: 'flowDotPromptHistory',
    inputSchema: HelloSchema,
    outputSchema: z.any(),
  },
  async (input) => {
    const hello = ai.prompt('hello', {
      variant: 'history',
    });
    return (await hello(input)).text;
  }
);

So, what I'm thinking is that perhaps the traces should look something like this:

hello.history
  |- render
  |- generate
    |- googleai/gemini-1.5-flash

os do you think it would get too verbose? extra span with redundant info?

Semi-related: I was also chatting with Shruti yesterday... I think we should change render to return GenereateActionOptions instead of GenerateRequest....

MichaelDoyle · 2025-02-28T13:24:18Z

I think that's the right approach. Been swept up in some other things, but will try and knock this out today.

MichaelDoyle · 2025-03-01T19:38:05Z

@pavelgj I got this going for the generate case. I wasn't exactly sure what to do for prompt.stream(), though, since generateStreaming is sync while runInNewSpan returns a Promise.

pavelgj · 2025-03-11T14:03:42Z

@pavelgj I got this going for the generate case. I wasn't exactly sure what to do for prompt.stream(), though, since generateStreaming is sync while runInNewSpan returns a Promise.

I swear I replied to this.... sigh....

One thing I can think of is to use the Channel approach used in generateStream to wrap it manually:

genkit/js/ai/src/generate.ts

Line 466 in 238b656

let channel = new Channel<GenerateResponseChunk>();

schnecle

We might need to add to our action types in the trace and log handling in the plugin. We might still omit the input/output for this one. That can definitely be a followup item though.

MichaelDoyle · 2025-05-07T02:06:44Z

@schnecle same applies to the generate util as well, correct? maybe we should do a separate pass on metrics. I think theres a couple things there.

MichaelDoyle requested a review from pavelgj February 25, 2025 03:23

github-actions bot added the js label Feb 25, 2025

schnecle approved these changes Feb 25, 2025

View reviewed changes

MichaelDoyle force-pushed the prompt-spans branch from 64697ac to d194007 Compare March 1, 2025 19:33

MichaelDoyle requested a review from schnecle March 3, 2025 03:59

schnecle approved these changes May 6, 2025

View reviewed changes

MichaelDoyle added 2 commits June 2, 2025 16:56

fix: add span for prompt template rendering

0c8fdd6

Wip

a76ed30

MichaelDoyle force-pushed the prompt-spans branch 3 times, most recently from c2fa5de to eec35bc Compare June 4, 2025 03:34

added span for .prompt stream

921e5b9

MichaelDoyle force-pushed the prompt-spans branch from eec35bc to 921e5b9 Compare June 4, 2025 03:36

MichaelDoyle merged commit 7df3fa9 into main Jun 4, 2025
5 checks passed

MichaelDoyle deleted the prompt-spans branch June 4, 2025 03:44

github-project-automation bot moved this to Done in Genkit Backlog Jun 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: add span for prompt template rendering #2165

fix: add span for prompt template rendering #2165

Uh oh!

MichaelDoyle commented Feb 25, 2025 •

edited

Loading

pavelgj commented Feb 25, 2025

MichaelDoyle commented Feb 25, 2025 •

edited

Loading

schnecle left a comment

pavelgj commented Feb 26, 2025

MichaelDoyle commented Feb 28, 2025

MichaelDoyle commented Mar 1, 2025

pavelgj commented Mar 11, 2025

schnecle left a comment

MichaelDoyle commented May 7, 2025

Uh oh!

Labels

3 participants

fix: add span for prompt template rendering #2165

fix: add span for prompt template rendering #2165

Uh oh!

Conversation

MichaelDoyle commented Feb 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

pavelgj commented Feb 25, 2025

MichaelDoyle commented Feb 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

schnecle left a comment

Choose a reason for hiding this comment

pavelgj commented Feb 26, 2025

MichaelDoyle commented Feb 28, 2025

MichaelDoyle commented Mar 1, 2025

pavelgj commented Mar 11, 2025

schnecle left a comment

Choose a reason for hiding this comment

MichaelDoyle commented May 7, 2025

Uh oh!

Labels

3 participants

MichaelDoyle commented Feb 25, 2025 •

edited

Loading

MichaelDoyle commented Feb 25, 2025 •

edited

Loading