This is a cache of https://developer.ibm.com/tutorials/develop-voice-assistant-watsonx-orchestrate/. It is a snapshot of the page as it appeared on 2025-12-16T02:29:28.263+0000.
Developing a voice-first, multi-domain AI assistant using watsonx Orchestrate and Apple Siri - IBM Developer
Large language models rely on large training datasets, but those datasets have limits: they have a knowledge cutoff date and only include publicly available information. IBM watsonx Orchestrate addresses this issue by letting you bring your own private documents and inject real-time enterprise knowledge directly into the LLM.
In a busy workday, you're constantly jumping between tasks, asking questions about long custom documents, checking on your company’s stock trend, reviewing a GitHub repo, sending quick follow-up emails to your colleagues, or doing quick calculations on the fly.
Frequently, we use assistants like Apple’s Siri (or Amazon’s Alexa) to help us accomplish our tasks. However, most AI assistants still get in the way more than they help:
They struggle to support multiple modes (voice, text, documents, APIs)
They lose context when switching between different kinds of tasks
You frequently have to tap, type, or deal with manual prompts
They can’t follow or execute complex, multi-step workflows
Siri now integrates with IBM watsonx Orchestrate enabling developers to create a smooth, voice-driven assistant that can handle real work across multiple domains instantly and hands-free. Whether you're running between meetings or multitasking at your desk, this integration gives you a conversational way to get answers, automate steps, and stay focused on what actually matters.
In this tutorial, you will create a simple, voice-driven workflow where Siri captures the request, a watsonx Orchestrate agent retrieves the right enterprise knowledge using RAG, and the user gets instant answers or automated actions.
In this video, I demonstrate how a user interacts with Siri, which connects with watsonx Orchestrate to answer questions such as “What was IBM's total revenue in Q3 2025?” or “How many developers in IBM use Project Bob?”
Architecture: A voice-first, multi-domain AI agent ecosystem
This solution combines two powerful technologies:
Apple Shortcuts (and Siri) - For seamless iOS integration and voice activation
watsonx Orchestrate - For intelligent workflow orchestration
This is the sequence of steps followed in this sample use case:
The user communicates with their voice with Siri using natural language.
Siri triggers an Apple Shortcut that retrieves an IAM token for watsonx Orchestrate.
The token is used to call the watsonx Orchestrate /chat/completions endpoint.
watsonx Orchestrate performs retrieval, and reasoning, and response.
The Apple Shortcut speaks the answer back through Siri.
Prerequisites
IBM watsonx Orchestrate on IBM Cloud with IAM access. Sign up for a free trial using these steps in the watsonx Orchestrate docs.
An Apple ID signed in with Siri enabled.
MacBook admin access to modify and interact with Apple’s Siri AI tool.
Apple Shortcuts using an iPhone, a MacBook, or an iPad.
Steps
To create your voice-first, multi-domain AI agent ecosystem, complete these steps:
Create a RAG agent using watsonx Orchestrate.
Get the API Key for watsonx Orchestrate.
Create an Apple Shortcut to connect Siri to watsonx Orchestrate.
Test your voice assistant.
Step 1. Create a RAG agent using watsonx Orchestrate
Log in to IBM watsonx Orchestrate. Create a no-code agent (for detailed step-by-step instruction on creating no-code agent, refer to this tutorial) with the following name and description:
Name: Test RAG pipeline with Siri Description: This agent is built to connect Apple Siri with watsonx Orchestrate and your custom knowledge base.
In the Knowledge section, click Add source. Then click New knowledge, and then click Upload files.
Name: Apple Siri RAG Pipelines Description: The knowledge base includes the IBM earning reports, to demonstrate simple Apple Siri and watsonx Orchestrate integration for RAG use cases.
Try to chat with the agents. For example, "What was IBM's total revenue in Q3 2025?"
Step 2. Get the API Key for watsonx Orchestrate
After you tested the agent in the watsonx Orchestrate UI, you can import it into Apple Siri using the watsonx Orchestrate API.
To get the watsonx Orchestrate API details, go to Settings and click Generate API key. Then, copy your service instance URL, which contains your tenant’s ID.
Open your agent in the browser and copy the Agent ID from the end of the service instance URL. Your final URL endpoint format will be like this: https://api.us-south.watson-orchestrate.cloud.ibm.com/instances/<TENANT-ID>/v1/orchestrate/<AGENT-ID>/chat/completions. Replace us-south with your region.
Save this URL. You will need to paste it into the Apple Shortcut later in this tutorial.
Step 3. Create an Apple Shortcut to connect Siri to watsonx Orchestrate
A custom Apple Shortcut connects Siri to the watsonx Orchestrate API.
On your Apple device, add the Apple Shortcut “Ask watsonx” using this iCloud link, and then click Get Shortcut, then Open Shortcuts, and finally Add Shortcut.
Right-click Ask Watson-X, and then click Edit.
Expand the “Get Contents of https://iam.cloud.ibm.com/identity/token” drop-down, and replace the apikey value with the value of your watsonx Orchestrate API key from the previous step.
Take your service instance URL (which contains the Tenant ID) and append your Agent ID to form the complete chat completions endpoint. For example, in this URL https://api.us-south.watson-orchestrate.cloud.ibm.com/instances/<TENANT-ID>/v1/orchestrate/<AGENT-ID>/chat/completions. Replace and with your own values and replace us-south with your region, and use this final URL in the block after "Get content of" (see the following screen capture):
In the Settings app for your Apple device, go to Keyboard and enable the Dictation option (by default it's not enabled).
The first part of the Ask Watson-X shortcut should look something like this:
The second part should look something like this:
The shortcut performs eight actions in order:
Receive Trigger - Starts the shortcut through Siri or Quick Actions.
Dictate text – Listens and converts your speech into text.
POST Request (IAM Token) – Sends your API key to IBM’s IAM to get an access_token in realtime.
Extracts Token – Reads the access_token value from the IBM’s IAM response.
POST Request (watsonx Orchestrate Chat API) – Sends the dictated text to the watsonx Orchestrate /chat/completions endpoint using Authorization: Bearer <access_token>.
Extract Choices – Gets the choices array from the watsonx Orchestrate response.
Extract Message Content – Retrieves the assistant’s text reply.
Speak Response – Siri reads the final answer aloud.
Step 4. Test your voice assistant
Here are simple, practical test questions you can ask Siri to validate the agent using your PDFs and RAG agents that has been deployed. If you change or add more documents, you can customize these tests accordingly.
To test, say the following:
You: “Hey Siri, ask watsonx”
Siri responds: “What’s the text?”
You: “How many developers in IBM use Project Bob?”
Siri responds: “More than 8,000”
Extending the integration between Siri and watsonx Orchestrate with tools
You can extend the integration between Siri and watsonx Orchestrate further with these scenarios. The python code for these different tools is available on GitHub, in case you’d like to explore any of these tools.
Purpose: Execute Python code for calculations or quick evaluations.
Capabilities: Mathematical operations, custom logic execution, and lightweight data processing.
Safety: Runs in a controlled, read-only environment.
Typical Use Case: “Calculate compound interest on 10,000 at 5% for 10 years.”
To quickly import all agents, tools, connections, and knowledge bases, use this automated shell script in a terminal.
Summary and next steps
This tutorial links Apple Siri to watsonx Orchestrate to deliver a voice-first RAG assistant that pulls enterprise knowledge from your private documents and returns instant, hands-free answers or automated actions.
It combines Apple Shortcuts for voice capture, watsonx Orchestrate for knowledge routing and workflow orchestration, and a secure API/token flow so Siri can call the watsonx Orchestrate chat endpoint and speak back RAG-powered responses.
To extend this use case, you can integrate the watsonx orchestrate agents with other channels like WhatsApp, Slack and Facebook Messenger. See this GitHub repo for the detailed steps.
Acknowledgments
This tutorial was produced as part of the IBM Open Innovation Community initiative: Agentic AI (AI for Developers and Ecosystem).
The author deeply appreciates the support of Ela Dixit, Ahmed Azraq, and Michelle Corbin for the guidance on reviewing and contributing to this tutorial.
About cookies on this siteOur websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising.For more information, please review your cookie preferences options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.