Pixtral 12B

Pixtral 12B

by

Mistral

Model ID:

pixtral-12b

Use This Model

Frequently Asked Questions

  1. What are the key features?
    It's a powerful vision-language model capable of understanding and generating text and images, excelling in multimodal tasks.

  2. How does it handle multimodal tasks?
    It effectively combines visual and textual information for tasks like image captioning, visual question answering, and image generation.

  3. What is the context window size?
    The specific context window size is not publicly disclosed.

  4. How does it compare to other vision-language models?
    It's considered competitive in its size range, offering strong performance in vision-language tasks and representing state-of-the-art capabilities.

Still have questions?

Cant find the answer you’re looking for? Please chat to our friendly team.

Get In Touch

Model Specifications

Release Date:

17/9/2024

Max. Output Tokens:

4K

License:

Open-Source

Technical Report/Model Card:

© 2024 Portkey, Inc. All rights reserved

Pixtral 12B

Pixtral 12B

by

Mistral

Model ID:

pixtral-12b

Chat

Vision