Passing Images and Files

Many language models can process more than just text—they can analyze images, PDFs, and other files. The AI SDK provides built-in support for sending files to your LLM provider's API.

Right now, your chat application has a file upload button on the frontend, but it doesn't actually do anything with the file. The backend is expecting only text messages, so uploading an image won't work.

You need to modify the form submission handler to capture the uploaded file and send it along with the user's text message to the LLM.

Steps To Complete

Convert the File to a Data URL

Look at the fileToDataURL helper function already provided in your code

This function converts a File object from the form into a string that the AI SDK can send over the network.

const fileToDataURL = (file: File) => {
  return new Promise<string>((resolve, reject) => {
    const reader = new FileReader();
    reader.onload = () => resolve(reader.result as string);
    reader.onerror = reject;
    reader.

Update the Form Submission Handler

Modify the onSubmit callback in the ChatInput component

Instead of passing only text to sendMessage(), you need to pass a parts array that includes both a text part and an optional file part.

onSubmit={async (e) => {
  e.preventDefault();

  const formData = new FormData(
    e.target as HTMLFormElement,
  );
  const file = formData.get('file') as

Look at the message parts documentation to understand the structure you need to create.

Handle the Optional File Part

Create a file part object only if a file exists

Use the fileToDataURL function to convert the file to a data URL string. Include the file's mediaType property so the LLM knows what kind of file it is.

Update the sendMessage() call to use a parts array

The parts array should always include a text part with the user's input. If a file was selected, add a file part to the array as well.

Look at the solution code to see what the FileUIPart type looks like.

Test Your Implementation

Run the development server with pnpm run dev

Open http://localhost:3000 in your browser.

Click the Upload File button and select the image.png file from the problem folder

This is an image of Lake Bled in Slovenia.

Type "Could you describe this image?" in the chat input
Submit the form and check that the LLM successfully analyzes the image

The model should describe what it sees in the image rather than failing silently.

Make sure you're using a model that supports image analysis

Gemini 2.5 Flash has this capability built in.

Many language models can process more than just text—they can analyze images, PDFs, and other files. The AI SDK provides built-in support for sending files to your LLM provider's API.

You need to modify the form submission handler to capture the uploaded file and send it along with the user's text message to the LLM.

Steps To Complete

Convert the File to a Data URL

Look at the fileToDataURL helper function already provided in your code

This function converts a File object from the form into a string that the AI SDK can send over the network.

const fileToDataURL = (file: File) => {
  return new Promise<string>((resolve, reject) => {
    const reader = new FileReader();
    reader.onload = () => resolve(reader.result as string);
    reader.onerror = reject;
    reader.

Update the Form Submission Handler

Modify the onSubmit callback in the ChatInput component

Instead of passing only text to sendMessage(), you need to pass a parts array that includes both a text part and an optional file part.

onSubmit={async (e) => {
  e.preventDefault();

  const formData = new FormData(
    e.target as HTMLFormElement,
  );
  const file = formData.get('file') as

Look at the message parts documentation to understand the structure you need to create.

Handle the Optional File Part

Create a file part object only if a file exists

Use the fileToDataURL function to convert the file to a data URL string. Include the file's mediaType property so the LLM knows what kind of file it is.

Update the sendMessage() call to use a parts array

The parts array should always include a text part with the user's input. If a file was selected, add a file part to the array as well.

Look at the solution code to see what the FileUIPart type looks like.

Test Your Implementation

Run the development server with pnpm run dev

Open http://localhost:3000 in your browser.

Click the Upload File button and select the image.png file from the problem folder

This is an image of Lake Bled in Slovenia.

Type "Could you describe this image?" in the chat input
Submit the form and check that the LLM successfully analyzes the image

The model should describe what it sees in the image rather than failing silently.

Make sure you're using a model that supports image analysis

Gemini 2.5 Flash has this capability built in.

AI SDK Basics

LLM Fundamentals

Agents

Persistence

Context Engineering

Evals

Streaming

Agents and Workflows

Advanced Patterns

Reference

Steps To Complete

Convert the File to a Data URL

Update the Form Submission Handler

Handle the Optional File Part

Test Your Implementation

Steps To Complete

Convert the File to a Data URL

Update the Form Submission Handler

Handle the Optional File Part

Test Your Implementation

01AI SDK Basics0/13

AI SDK Basics

02LLM Fundamentals0/5

LLM Fundamentals

03Agents0/7

Agents

04Persistence0/4

Persistence

05Context Engineering0/5

Context Engineering

06Evals0/7

Evals

07Streaming0/4

Streaming

08Agents and Workflows0/4

Agents and Workflows

09Advanced Patterns0/4

Advanced Patterns

10Reference0/9

Reference

Passing Images and Files

Video Transcript

Video Transcript