Scrubbing sensitive content #2014

adtyavrdhn · 2025-06-18T08:22:43Z

Fixes #1571

Adds include_content flag in InstrumentationSettings, default set to True

Adds include_tool_args to remove the arguments from the running tool spans when incude_content is set to False

Adds example in logfire docs

hyperlint-ai · 2025-06-18T08:22:55Z

PR Change Summary

Introduced a new flag to manage sensitive content in instrumentation settings and updated documentation accordingly.

Added include_sensitive_content flag in InstrumentationSettings, default set to True
Implemented guard clauses to scrub sensitive content while retaining the event
Updated logfire documentation with an example for excluding sensitive content

Modified Files

docs/logfire.md

How can I customize these reviews?

Check out the Hyperlint AI Reviewer docs for more information on how to customize the review.

If you just want to ignore it on this PR, you can add the hyperlint-ignore label to the PR. Future changes won't trigger a Hyperlint review.

Note specifically for link checks, we only check the first 30 links in a file and we cache the results for several hours (for instance, if you just added a page, you might experience this). Our recommendation is to add hyperlint-ignore to the PR to ignore the link check for this PR.

pydantic_ai_slim/pydantic_ai/models/instrumented.py

docs/logfire.md

alexmojaki

Tool call arguments also need to be omitted.

docs/logfire.md

pydantic_ai_slim/pydantic_ai/messages.py

pydantic_ai_slim/pydantic_ai/models/instrumented.py

tests/models/test_instrumented.py

alexmojaki · 2025-06-19T12:32:32Z

The tool call arguments also need to be excluded from the 'running tool' span.

tests/models/test_instrumented.py

alexmojaki

Reminder: The tool call arguments also need to be excluded from the 'running tool' span.

docs/logfire.md

tests/models/test_instrumented.py

…avrdhn/pydantic-ai into scrubbing_sensitive_content

pydantic_ai_slim/pydantic_ai/messages.py

docs/logfire.md

pydantic_ai_slim/pydantic_ai/_agent_graph.py

pydantic_ai_slim/pydantic_ai/messages.py

alexmojaki · 2025-06-23T10:03:19Z

pydantic_ai_slim/pydantic_ai/messages.py

                if body.get('content'):
                    body = new_event_body()
-                body['content'] = part.content
+                if settings.include_content:
+                    body['content'] = part.content


This implies that the number of events produced will depend on whether content is included if multiple tool calls are made in a single message.

Trying to understand what you mean exactly

Would you say the below test captures what you are trying to imply:

def test_otel_events_consistency_with_include_content(): """Test that the number of OpenTelemetry events is consistent regardless of include_content setting.""" # Create a response with multiple tool calls followed by text response = ModelResponse(parts=[ ToolCallPart('tool1', {'arg1': 'value1'}, 'call_1'), ToolCallPart('tool2', {'arg2': 'value2'}, 'call_2'), TextPart('Some text response') ]) settings_with_content = InstrumentationSettings(include_content=True) events_with_content = response.otel_events(settings_with_content) settings_without_content = InstrumentationSettings(include_content=False) events_without_content = response.otel_events(settings_without_content) assert len(events_with_content) == len(events_without_content), ( f"Event count differs: with_content={len(events_with_content)}, " f"without_content={len(events_without_content)}" )

sorry, i didn't think this through properly. the difference would be if a message had multiple text parts. but i don't think this happens in practice, so it isn't worth worrying about.

the difference would be if a message had multiple text parts. but i don't think this happens in practice

I'm not caught up on this whole conversation, but I do think a message with multiple text parts could be realistic. For example, multiple text parts could be used to separate different inputs:

text part: compare these two paragraphs, which is better?

text part: paragraph 1

text part: paragraph 2

Of course this could be done by concatenating all text into a single part.

Yep, I get it. Something like this would cause this issue.

response = ModelResponse(parts=[ TextPart('Some text response'), ToolCallPart('tool2', {'arg2': 'value2'}, 'call_2'), TextPart('Some more text response'), ToolCallPart('tool1', {'arg1': 'value1'}, 'call_1'), TextPart('Even more text response'), ])

tests/test_logfire.py

pydantic_ai_slim/pydantic_ai/models/instrumented.py

Co-authored-by: Alex Hall <[email protected]>

alexmojaki · 2025-06-25T13:10:23Z

Thanks!

adtyavrdhn added 4 commits June 18, 2025 13:34

Scrubbing prompts and completions, sensitive data from logs

81cce2f

Removing comment

5989ad2

Adding test for all events

c7cd7b8

Adding example to docs

810b1db

dhimmel reviewed Jun 18, 2025

View reviewed changes

pydantic_ai_slim/pydantic_ai/models/instrumented.py Outdated Show resolved Hide resolved

dhimmel mentioned this pull request Jun 18, 2025

Add a setting to remove prompts and completions from tracing #1571

Closed

DouweM assigned alexmojaki Jun 18, 2025

Kludex reviewed Jun 19, 2025

View reviewed changes

docs/logfire.md Show resolved Hide resolved

alexmojaki requested changes Jun 19, 2025

View reviewed changes

docs/logfire.md Outdated Show resolved Hide resolved

pydantic_ai_slim/pydantic_ai/messages.py Outdated Show resolved Hide resolved

pydantic_ai_slim/pydantic_ai/messages.py Outdated Show resolved Hide resolved

pydantic_ai_slim/pydantic_ai/models/instrumented.py Outdated Show resolved Hide resolved

alexmojaki reviewed Jun 19, 2025

View reviewed changes

tests/models/test_instrumented.py Outdated Show resolved Hide resolved

adtyavrdhn added 4 commits June 19, 2025 18:16

renaming to include content

8ebee59

typo

16a9920

Removing content key altogether + refactoring to remove duplicated code

1d2ba54

Adding test for binary_content as well

15384f3

alexmojaki reviewed Jun 20, 2025

View reviewed changes

tests/models/test_instrumented.py Outdated Show resolved Hide resolved

alexmojaki requested changes Jun 20, 2025

View reviewed changes

adtyavrdhn and others added 3 commits June 20, 2025 22:00

Retaining content for the kinds of data

94e08dd

Removing args from tool calls when include_content is false

08189b1

Merge branch 'main' into scrubbing_sensitive_content

3aa29dc

Kludex reviewed Jun 21, 2025

View reviewed changes

docs/logfire.md Outdated Show resolved Hide resolved

docs/logfire.md Outdated Show resolved Hide resolved

tests/models/test_instrumented.py Outdated Show resolved Hide resolved

adtyavrdhn added 4 commits June 21, 2025 17:53

Mistakes in docs

77e58d6

Merge branch 'scrubbing_sensitive_content' of https://github.com/adty…

58297f5

…avrdhn/pydantic-ai into scrubbing_sensitive_content

Adding test to check include_content behaviour in the logfire spans

1031dab

Adding test to check include_content behaviour in the logfire spans

c9a83e5

adtyavrdhn commented Jun 21, 2025

View reviewed changes

pydantic_ai_slim/pydantic_ai/messages.py Show resolved Hide resolved

Adding desc in logs

ac4c40a

alexmojaki requested changes Jun 23, 2025

View reviewed changes

Resolving comments

fb5361a

adtyavrdhn added 2 commits June 23, 2025 18:40

Asserting only one such span exists

c330c86

Adding that tool calls and args will also be excluded from the

1c75cfc

adtyavrdhn requested a review from alexmojaki June 24, 2025 13:43

alexmojaki reviewed Jun 25, 2025

View reviewed changes

tests/test_logfire.py Outdated Show resolved Hide resolved

pydantic_ai_slim/pydantic_ai/models/instrumented.py Outdated Show resolved Hide resolved

adtyavrdhn and others added 4 commits June 25, 2025 18:23

Update pydantic_ai_slim/pydantic_ai/models/instrumented.py

4f3ecd1

Co-authored-by: Alex Hall <[email protected]>

Update tests/test_logfire.py

7e81361

Co-authored-by: Alex Hall <[email protected]>

Merge branch 'main' into scrubbing_sensitive_content

f592149

Fix

70a4161

alexmojaki merged commit cbe0a92 into pydantic:main Jun 25, 2025
19 checks passed

adtyavrdhn deleted the scrubbing_sensitive_content branch June 25, 2025 13:54

dhimmel mentioned this pull request Jul 4, 2025

instrument_pydantic_ai: option to scrub/omit message content pydantic/logfire#1142

Closed

Scrubbing sensitive content #2014

Scrubbing sensitive content #2014

Uh oh!

Conversation

adtyavrdhn commented Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Adds include_content flag in InstrumentationSettings, default set to True

Adds include_tool_args to remove the arguments from the running tool spans when incude_content is set to False

Adds example in logfire docs

Uh oh!

hyperlint-ai bot commented Jun 18, 2025

PR Change Summary

Uh oh!

Uh oh!

Uh oh!

alexmojaki left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alexmojaki commented Jun 19, 2025

Uh oh!

Uh oh!

alexmojaki left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alexmojaki Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

adtyavrdhn Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

alexmojaki Jun 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dhimmel Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

adtyavrdhn Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alexmojaki commented Jun 25, 2025

Uh oh!

Uh oh!

Uh oh!

adtyavrdhn commented Jun 18, 2025 •

edited

Loading

alexmojaki Jun 23, 2025 •

edited

Loading