Vision Mode
Vision is the Bankai mode that can see your screen. Hold the Vision shortcut, ask about whatever is in front of you — a chart, an error message, a spreadsheet, a paragraph in another app — and Bankai takes a screenshot, looks at it, and answers. The answer lands right at your cursor, or is read out loud if you ask for it.
It is the difference between describing what you are looking at and simply asking about it. Instead of "there's a number in the third column of a table about Q3 revenue, what's the growth rate," you just say "what's the growth rate here?" while looking at the table. Bankai sees what you see.
Vision Mode runs in the Copera desktop app (Windows and macOS), because it needs to capture your screen — something a web browser cannot do. On macOS it also needs Screen Recording permission (see below).
Turning On Vision Mode
Vision is off until you enable it:
- Open the AI section and go to Voice Settings.
- In the Modes section, turn on Enable Vision Mode.
Once enabled, Vision joins your mode rotation (its color is magenta and its icon is an eye) and its shortcut becomes active.
macOS: Screen Recording Permission
On macOS, capturing the screen requires the system's Screen Recording permission. The first time you enable Vision, Bankai walks you through granting it:
- Bankai opens a permission step and asks you to allow Screen Recording in System Settings.
- After you grant it, macOS requires Copera to restart for the change to take effect — Bankai shows a Restart Copera button to do this in one click.
- Once Copera restarts, Vision is ready to use.
Vision needs this permission only on macOS. On Windows, Vision works as soon as you enable the mode — there is no extra system permission to grant.
Asking About Your Screen
Using Vision is the same hold-speak-release rhythm as the rest of Bankai:
- Hold the Vision shortcut. Bankai starts listening immediately and grabs a screenshot of your current screen at the same time.
- Ask your question about what is on screen — "summarize this," "what does this error mean," "what's the total in this column?"
- Release the key. Bankai reads the screenshot alongside your question and answers.
Because the screenshot is captured the instant you start talking, Vision feels just as fast as the other modes — there is no separate "take a screenshot first" step.
Where the Answer Appears
By default, the Vision answer is pasted at your cursor, just like Ask mode — ready to drop into a document, chat message, or wherever you are working.
If you would rather hear the answer, just ask for it out loud — say something like "tell me out loud" or "read it to me" as part of your question, and Bankai speaks the answer through your speakers instead of pasting it.
Examples
| What you ask (while looking at…) | What Vision does |
|---|---|
| A revenue chart — "what's the trend here?" | Reads the chart and describes the trend |
| An error message — "what does this mean and how do I fix it?" | Explains the error and suggests a fix |
| A spreadsheet — "what's the total of this column?" | Reads the numbers on screen and answers |
| A long article — "summarize this in three bullets" | Summarizes what is visible on screen |
| A form in another language — "translate this to English" | Reads and translates the on-screen text |
Vision in Omni Mode
You do not always have to switch to Vision deliberately. If you use Omni mode (where Bankai figures out your intent automatically) and Vision is enabled, Omni will reach for your screen on its own when your question sounds like a screen question — for example, "what's on my screen right now?" or "look at this spreadsheet and tell me the total."
When that happens, Omni captures a screenshot, answers using it, and the result is saved as a Vision session in your history. If your Omni question is not about the screen, no screenshot is taken — so everyday Omni use stays fast.
Keep Vision enabled and stay in Omni mode for the most natural experience: just talk, and Bankai pulls in your screen only when the question actually calls for it.
Your Screenshots Stay on Your Computer
Vision screenshots are stored locally on your device, not uploaded to Copera. They follow the same retention rules you set for your Bankai audio, so you control how long they are kept (see Voice Settings).
In your Bankai History, Vision sessions show the screenshot that was captured alongside your question and the answer. Click a screenshot to open it full-size in a viewer.
Retrying a Vision Answer
If a Vision answer was not quite what you needed, you can retry it from your Bankai History — Bankai re-runs the request using the screenshot it already captured, so you do not have to recreate the moment.
Settings and Configuration
| Setting | What it controls | Default |
|---|---|---|
| Enable Vision Mode | Turns Vision on or off (in the Modes section of Voice Settings). | Off |
| Screen Recording permission (macOS only) | System permission Vision needs to capture your screen. Granted through a guided step on macOS. | Not granted |
| Screenshot retention | How long captured screenshots are kept on your device — follows your audio retention setting. | — |
Tips and Best Practices
Put the thing you want to ask about clearly on screen before you hold the shortcut. Vision captures whatever is visible the moment you start speaking.
For small text — fine print, dense tables, code — Vision captures at high detail so it can read the details, but a larger or zoomed-in view still gives the most accurate answer.
Use "tell me out loud" when your hands are busy or your eyes are on something else — Vision will speak the answer instead of pasting it.
Frequently Asked Questions
Why don't I see Vision Mode?
Vision is off by default — turn on Enable Vision Mode in the Modes section of Voice Settings. It also requires the Copera desktop app; it is not available in a web browser.
Why does Vision not work on my Mac after I enabled it?
macOS needs the Screen Recording permission, and newly-granted Screen Recording only takes effect after Copera restarts. Follow the guided permission step and click Restart Copera when prompted.
Are my screenshots sent to Copera or stored in the cloud?
Your Vision screenshots are stored locally on your own device and follow your Bankai retention settings. They are not kept in your notes or uploaded to Copera storage.
Can Bankai read my screen automatically?
Only when you ask. Vision captures a screenshot when you hold the Vision shortcut, or when you are in Omni mode and your question clearly asks about the screen. It never watches your screen in the background.
Where does the answer go?
By default it is pasted at your cursor. Ask for it "out loud" and Bankai speaks the answer instead.
Related Features
- Voice Modes — All of Bankai's modes and when to use each.
- Keyboard Shortcuts — The Vision shortcut for your platform.
- Voice Settings — Enable Vision, set retention, and tune the answer bubble.
- History & Analytics — Review your Vision sessions and their screenshots.