Logo

Voice

How SystemPrompt's voice capabilities leverage MCP for natural interactions

Voice Control with MCP

SystemPrompt combines the power of the Model Context Protocol (MCP) with advanced voice processing to create a uniquely powerful, voice-first AI assistant experience. This page explains how these technologies work together and how to get the most out of voice interactions.

Voice-First Design

SystemPrompt has been designed from the ground up as a voice-first AI assistant:

  • Natural Conversation: Speak naturally rather than using rigid command structures
  • Contextual Understanding: Maintains conversation context across interactions
  • Interruption Handling: Handles mid-sentence corrections and interruptions
  • Text Fallback: Seamlessly switch between voice and text as needed

MCP's Role in Voice Interactions

The Model Context Protocol enhances voice interactions in several key ways:

1. Tool Access via Voice

MCP allows SystemPrompt to access tools through voice commands:

  • "Search Reddit for top posts about AI voice assistants"
  • "Summarize my unread emails from today"
  • "Create a new Notion page for my project ideas"
  • "Move all PDF files from Downloads to Documents folder"

Each of these commands triggers the appropriate MCP tool, with your permission.

2. Resource Navigation

Voice commands can efficiently navigate resources exposed through MCP:

  • "Show me the most recent post on r/technology"
  • "Open the email from John about the project deadline"
  • "Find files in my Documents folder containing budget information"
  • "Go to the second page of search results"

3. Context Persistence

MCP's context management combines with voice interaction history:

  • Previous commands and responses remain in context
  • References to earlier statements are understood
  • Long-running conversations maintain coherence
  • Cross-device continuity for ongoing discussions

Voice Command Patterns

SystemPrompt understands various voice command patterns:

Direct Questions

"What's the most upvoted post on Reddit today?"
"How many unread emails do I have?"
"What files are in my Downloads folder?"

Action Requests

"Search Reddit for top posts about AI voice assistants"
"Create a summary of my emails from today"
"Move the PDF files from Downloads to Documents"

Follow-Up Commands

"Show me the comments on that post"
"Summarize the first three emails"
"Open the second file you mentioned"

Multi-Step Workflows

"Find posts about AI on Reddit, summarize the top three, and create a new document with the summary"
"Check my emails from yesterday, identify any that need responses, and draft replies"
"Search my files for budget information, extract the key figures, and create a chart"

Voice Best Practices

To get the best results with SystemPrompt's voice interface:

  1. Be Natural: Speak in a natural, conversational manner
  2. Provide Context: Include relevant details in your requests
  3. Use Names: Reference services by name ("on Reddit," "in Gmail")
  4. Be Patient: Allow a moment for processing complex commands
  5. Use Interruptions Wisely: Interrupt when needed, but avoid frequent interruptions
  6. Combine with Text: Use text for complex inputs when appropriate

On this page