Do Nothing

AI-powered Chrome extension that can read pages, reason with Gemini, and execute real DOM actions (click, type, extract, scroll, fill forms, and more).

Why This Exists

Most browser assistants stop at suggestions. Do Nothing executes actions directly on the current page through a controlled tool loop.

What It Can Do

Run browser automation from natural language prompts.
Auto-select domain-specific skills (for LinkedIn, jobs, extraction, summaries, etc.).
Stream progress via pipeline steps and tool call states.
Keep session history and long-term memory in local extension storage.
Support voice typing in the sidepanel chat input.

How It Works (High Level)

flowchart LR
    U[User Prompt in Sidepanel] --> BG[Background Service Worker]
    BG --> CTX[Context Builder<br/>+ Memory + Skills]
    CTX --> LLM[Gemini Model]
    LLM -->|Function Calls| DOM[Content Script DOM Engine]
    DOM -->|Action Results| LLM
    LLM -->|Final Text| BG
    BG --> UI[Sidepanel Updates<br/>chat, pipeline, tools]

Read deeper architecture docs in ARCHITECTURE.md.

Built-In Skills

The registry lives in src/background/skills.ts and includes skills such as:

LinkedIn Connect / Message / Follow / Comment
X/Twitter Follow
Job Applier
Page Summarizer
Data Extractor
Form Filler
Auto Scroller
Email Drafter
Shopping Assistant

Skills are matched by URL + intent score and can be explicitly selected from the Skills tab.

Quick Start

1. Install dependencies

npm install

2. Build extension

npm run build

3. Load in Chrome

Open chrome://extensions
Enable Developer mode
Click Load unpacked
Select the dist/ folder

4. Configure API key

Open extension sidepanel
Go to Settings
Add your Gemini API key
Save settings

Usage Guide

Chat panel

Ask with direct intent: Extract all job titles and company names from this page.
Use explicit constraints: Apply only to remote React roles with Easy Apply.
If tasks are complex, split into smaller prompts.

Skills panel

Pick a specific skill when you need deterministic behavior.
Use sample prompts from each skill card as templates.

Memory panel

Store durable facts you want future tasks to reuse.
Keep memory concise and structured for better reuse.

Voice typing

Click the mic icon in chat input.
Allow microphone permission when prompted.
Speak in short phrases for cleaner transcripts.

Use It Better (Practical Tips)

Start with a dry run prompt: Show me what you'd do first, then execute.
Prefer exact selectors/targets in prompts when possible.
For bulk actions, define limits: first 10, top 5, only visible.
For form tasks, provide profile data once in memory, then reuse.
Tune Action Delay in Settings to reduce rate-limit or anti-bot triggers.

Project Structure

src/
  background/    Service worker, Gemini loop, skills, memory, session
  content/       DOM interaction engine injected into pages
  sidepanel/     React UI (chat, skills, memory, history, settings)
dist/            Webpack build output for unpacked extension loading

Development

npm run dev     # watch mode
npm run build   # production build
npm run clean   # remove dist

Security & Privacy Notes

Memory and session data are stored in chrome.storage.local.
API key is stored locally via extension settings.
The extension executes DOM actions only on pages where content scripts run.

Troubleshooting

No response / errors: verify API key and selected model in Settings.
Action fails repeatedly: ask for smaller scoped actions and clearer targets.
Voice typing not starting: check browser microphone permissions for extensions.
Content script issues: reload extension and the active tab.

Contributing

Contributions are welcome, especially new day-to-day automation skills.

Contribution guide: CONTRIBUTING.md
Architecture details: ARCHITECTURE.md

Support & Community

Buy me a coffee: ☕ buymeacoffee.com/bhuvan_ade
Star this repo: ⭐ github.com/BhuvanAde/do-nothing

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
icons		icons
src		src
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
CONTRIBUTING.md		CONTRIBUTING.md
PRIVACY.md		PRIVACY.md
README.md		README.md
manifest.json		manifest.json
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
webpack.config.js		webpack.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Do Nothing

Why This Exists

What It Can Do

How It Works (High Level)

Built-In Skills

Quick Start

1. Install dependencies

2. Build extension

3. Load in Chrome

4. Configure API key

Usage Guide

Chat panel

Skills panel

Memory panel

Voice typing

Use It Better (Practical Tips)

Project Structure

Development

Security & Privacy Notes

Troubleshooting

Contributing

Support & Community

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Do Nothing

Why This Exists

What It Can Do

How It Works (High Level)

Built-In Skills

Quick Start

1. Install dependencies

2. Build extension

3. Load in Chrome

4. Configure API key

Usage Guide

Chat panel

Skills panel

Memory panel

Voice typing

Use It Better (Practical Tips)

Project Structure

Development

Security & Privacy Notes

Troubleshooting

Contributing

Support & Community

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages