Skip to content

Conversation

@mantrakp04
Copy link

@mantrakp04 mantrakp04 commented Mar 11, 2025

  • Extend multi-modal content handling to support PDF, audio, and video uploads
  • Add new content types for documents and media in interfaces
  • Update Anthropic and Google Generative AI chat models to handle additional file types
  • Refactor multi-modal utility functions to support broader content processing
  • Improve flexibility for different LLM models with multi-modal content

discord post: https://discord.com/channels/1087698854775881778/1349020605197848690

- Extend multi-modal content handling to support PDF, audio, and video uploads
- Add new content types for documents and media in interfaces
- Update Anthropic and Google Generative AI chat models to handle additional file types
- Refactor multi-modal utility functions to support broader content processing
- Improve flexibility for different LLM models with multi-modal content
@mantrakp04 mantrakp04 marked this pull request as draft March 11, 2025 15:12
@jquinter
Copy link

jquinter commented Apr 2, 2025

I think this PR would be very useful to integrate!

+1

@marcosmarf27
Copy link

+1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

3 participants