Feature Request
AI Desktop Assistant for Windows: Automate OS and App Tasks Beyond Browser
Problem Statement
Current AI-powered browsers like Comet are limited to web-based automation and information retrieval tasks. There is no unified tool for automating operations across both the browser and the Windows desktop, such as file management, launching programs, scheduling, and handling system notifications. This limitation makes users switch between multiple apps or manual steps, losing productivity and convenience.
Proposed Solution
Develop a Windows desktop app powered by a conversational AI (like Jarvis) that:
-
Automates browser interactions AND system tasks (file operations, app launching, search, scheduling, emails, notifications)
-
Uses a natural language interface for commands and queries
-
Integrates with desktop APIs (Windows Shell, Cortana, etc.) for automation
-
Learns user patterns over time for proactive assistance
Example Use Case: -
โFind and open my last downloaded PDF, email it to my teacher, and set a reminder to review it tomorrow.โ
-
โSchedule my weekly test times and notify me before each.โ
API Impact
-
May require new endpoints for desktop automation tasks (e.g., file operations, OS-level notifications)
-
Chat completions, retrieval, and workflow orchestration APIs for natural dialogue
-
Integration with calendar, email, and notification APIs, possibly needing additional parameters for system control permissions
-
If using a specific research-focused model (e.g., Sonar Deep Research), adapt capabilities for the desktop context
Alternatives Considered
-
Existing assistants (Windows Copilot, Cortana, Siri) do not provide seamless cross-app automation using conversational AI.
-
Browser-based tools (like Comet) are restricted to web tasks and cannot automate system-level functions.
-
Scripting and automation tools (AutoHotkey, Power Automate) require manual setup and lack intelligent conversation capabilities.
Additional Context
This assistant could be invaluable for students, professionals, and power users, streamlining workflow and improving productivity. Integrating with developer tools/APIs can further expand its usefulness for tech-savvy users experimenting with AI.
Contact me at yashwinvisakan@gmail.com by a normal kid interestedin knowingw about AI and did more projects on it