Chrome Web Store live Apache 2.0

A reading companion that points. ☞

Onhand is a contextual AI for learning and research. Ask about the page you're reading. It highlights the line, leaves a margin note, and explains it in the side panel. No second window, no copy-paste.

☞ Add to Chrome Read the source ↗

Realtime voice tutor 1 min 36 sec demo

nlp.seas.harvard.edu/annotated-transformer/

The Annotated Transformer

Attention

Attention maps a query and key-value pairs to an output. The model compares the query with each key, then uses the resulting weights to average the values.

We call our particular attention Scaled Dot-Product Attention. Queries and keys have dimension d_k; values have dimension d_v.

MatMulScaleMaskSoftMaxMatMul QKV

In implementation, the attention scores are produced by multiplying the query matrix by the transposed key matrix, scaling by sqrt(d_k), and normalizing with softmax.

scores = torch.matmul(query, key.transpose(-2, -1))
scores = scores / math.sqrt(query.size(-1))
p_attn = scores.softmax(dim=-1)
return torch.matmul(p_attn, value)

Softmax turns the scaled scores into attention weights, so the final output is a weighted mixture of the value vectors.

On this page · 2 highlights, 1 note

10:55 PM · Onhand · Page-grounded

Explain scaled dot-product attention and why the scaling is needed; point me to the key code and equation.

Reading the page

Scaled dot-product attention compares every query with every key, divides the scores by sqrt(d_k), applies softmax, then uses those weights to mix the values.

Attention(Q,K,V) = softmax(QK^T / √d_k)V

Queries ask what each token is looking for.
Keys describe what each token can match.
The dot product gives the compatibility score.
Scaling keeps large dimensions from making softmax too sharp.
softmax converts scores into weights.

☞ Watch Onhand work the page — then hover or click a citation to jump to its evidence.

The manicule, a small pointing hand in the margin, has meant "look here, this part matters" in books since the twelfth century. Onhand brings the gesture back, with the model holding the pen.

The pointing hand · a brief defense of marginalia

☞What it does

Four things, done in the margin.

Onhand stays out of the way until you ask. When you do, it does the four things a good study partner would do.

☞

Points to the answer

Highlights the exact phrase or paragraph that answers your question, in gold, and scrolls it into view.

✎

Leaves a note in the margin

Drops a sentence or two next to your highlight, written in the article's own voice, so you can keep reading without losing your place.

⎘

Replay artifacts

Every session is saved with its highlights, notes, transcript, and a snapshot of the page. Come back tomorrow to the exact state you left.

◔

Voice tutor

Press Voice in the side panel and talk through a paper. The realtime model can see your highlights and the visible text. Experimental.

☞How it works

Three steps. No second window.

01 · OPEN

The page you're already reading.

Wikipedia, an arXiv paper, a Google Doc, a Substack. Anything Chromium can render. Onhand attaches to the active tab.

tab — active

arxiv.org/abs/1706.03762
Attention is all you need · pdf

02 · ASK

A question in the side panel.

Plain language. Onhand reads whatever it needs to ground the answer: the page, the selection, the visible headings.

side panel

Explain the scaled dot-product attention
and why we divide by √d_k.

03 · POINT

☞Onhand points at the answer.

Highlights the line, drops a margin note, answers in the side panel with citations back to the page. Then saves it all.

on the page

We call our particular attention "Scaled Dot-Product Attention"…
● ONHAND margin note added

☞Install

Onhand is on the Chrome Web Store.

Install from the Chrome Web Store in one click. It updates automatically.

Chrome Web Store · live now

Add Onhand to Chrome.

Prefer a manual install? Download the ZIP below, unzip it, and load it from chrome://extensions with developer mode on.

☞ Add to Chrome Download latest ZIP Release notes ↗

Build from source instead

☞ terminal · build the extension

# Clone and build
$ git clone https://github.com/Phineas1500/Onhand
$ cd Onhand
$ npm ci
$ npm run build:extension

# Verify the build
$ npm run smoke:browser-runtime
✓ browser runtime ready · 

# For a real provider call
$ OPENAI_API_KEY=… npm run smoke:browser-runtime -- --real-openai

Open the Chrome Web Store listing and click Add to Chrome.
Open Onhand's options page and pick how to connect: Onhand Free, OpenAI Codex, or a provider API key.
Pin the extension, open any page, and click the icon to open the side panel.
Highlight a passage, ask a question, and let Onhand point ☞.

☞

Apache 2.0. Onhand is open source. Sessions stay on your machine in chrome.storage.local, and provider keys never leave your device. Onhand Free routes model requests through the hosted Onhand Worker with anonymous diagnostics for reliability and abuse monitoring.

☞The runtime

36 small, auditable tools that act across your pages.

Each tool is a single, auditable operation against a specific browser tab. Onhand starts with the active page and can target clearly related open sources; you read the script in the side panel after every turn.

☞ Read & search

browser_extract_content
browser_get_visible_text
browser_get_selection
browser_get_viewport_headings
browser_get_scroll_state
browser_get_visible_region_image
browser_get_dom
browser_textbook_search

⌕ PDF research

browser_search_linked_pdf_corpus
browser_pdf_search
browser_pdf_read_pages
browser_pdf_jump_to_page
browser_pdf_capture_page_image
browser_pdf_find_citation
browser_open_pdf_in_onhand_viewer

✎ Annotate & navigate

browser_highlight_text
browser_show_note
browser_scroll_to_annotation
browser_clear_annotations
browser_list_tabs
browser_activate_tab
browser_navigate

◔ Act & interact

browser_click
browser_type
browser_click_text
browser_type_by_label
browser_find_elements
browser_wait_for_selector
browser_pick_elements

⎘ Debug & capture

browser_capture_screenshot
browser_capture_state
browser_list_artifacts
browser_restore_state
browser_collect_console
browser_collect_network
browser_run_js (optional, constrained)

☞Read further

The thinking behind it.

★ Foundations

Constitution

What Onhand is, what it isn't, and the principles that decide every UX call. Read this first.

Open doc ↗

▸ Roadmap

Product & implementation plan

What's shipped, what's next, and the sequencing for PDFs, search, and richer session restore.

Open doc ↗

◔ Experimental

Voice architecture

How the gpt-realtime-2.1 WebRTC voice tutor works against the same page-grounded toolset.

Open doc ↗