Wheelder Human Agent — Autonomous Browser Testing Platform

v1.0 March 07, 2026

A full-stack autonomous QA agent that opens browsers, fills forms, runs security tests, and generates evidence reports. Built December 2025 with 10 git commits. The foundational concept behind modern computer-use agentic systems like OpenClaw, Claude Code, and Perplexity Computer.

Wheelder Human Agent

Autonomous QA Testing Platform — The Foundation of Full-Stack Agentic Systems

A browser-based autonomous agent that simulates human interactions — opening browsers, filling forms, clicking buttons, running security tests, and generating evidence reports. Built in December 2025 with a verifiable git commit history. This project represents the foundational concept behind today's full-stack agentic systems.

1. What Is Wheelder Human Agent?

Wheelder Human Agent is an autonomous browser-based testing agent that does what a human QA tester does — but automatically, across multiple browsers, viewports, and test scenarios simultaneously.

It opens a real browser (Chromium, Firefox, or WebKit), navigates to any web page, detects forms, fills them with intelligently generated test data, clicks buttons, validates responses, captures screenshots on failure, and generates comprehensive QA reports — all without any human intervention.

The agent supports:

  • Smart Test Mode — AI-powered auto-detection that works on ANY page automatically
  • Comprehensive Testing — Full UI/UX, security, validation & boundary testing
  • Security Testing — SQL injection, XSS, CSRF & input sanitization payloads
  • Custom YAML Tests — Define your own test steps declaratively
  • Multi-Browser Matrix — Chrome, Firefox, Safari simultaneously
  • Responsive Testing — Desktop, Tablet, Mobile viewports
  • Intelligent Variant Generation — Automatically creates 30-100+ test cases from a single spec
  • Evidence Capture — Screenshots, logs, and HTML reports for every test run
Why This Matters: This is the foundational concept behind what the industry now calls "computer use agents" or "full-stack agentic systems" — autonomous software that can operate a computer, interact with web interfaces, fill forms, test applications, and perform tasks that previously required a human sitting at a screen. Wheelder Human Agent was built and committed to GitHub in December 2025, establishing a verifiable record of this innovation.

2. Live Screenshots

The following screenshots were captured from the live running application on localhost:8001:

Dashboard — Target URL & Test Types

The main interface allows selecting a target URL and choosing from 5 test types: Smart Test (AI-powered), Comprehensive, Quick, Security, and Custom YAML.

Dashboard Top - URL and Test Types

Dashboard — Browser Matrix, Viewports & Options

Multi-browser selection (Chrome, Firefox, Safari), responsive viewport testing (Desktop, Tablet, Mobile), and advanced options including visible mode, HTML reports, standard tests, custom YAML, and AI-powered variants.

Dashboard Options - Browsers, Viewports, Options

Dashboard — YAML Configuration & Action Buttons

Custom YAML test configuration with preset templates (Signup, Login, Generic Form, Requirements) and the execution controls (Run QA Test, Stop Tests, Clear Cache).

Dashboard Bottom - YAML Config and Actions

3. Complete Feature Breakdown

3.1 — 20+ Action Types

The agent supports a comprehensive set of browser actions that mirror everything a human tester would do:

Category Actions
Navigation open, click, hover, select, press, type
Form Actions fill, check, uncheck, clear, submit
Assertions expect_text, expect_visible, expect_hidden, expect_value, expect_url, expect_title, expect_count, expect_attribute
Performance measure_load_time, expect_load_time
Security expect_no_console_errors, expect_no_alert
Evidence screenshot (automatic on failure + manual)

3.2 — Two-Phase Testing Strategy

Phase 1 — Responsiveness: Runs the base test across all browsers (Chromium, Firefox, WebKit) × all viewports (Desktop, Tablet, Mobile) = up to 9 combinations. Validates that the UI renders correctly everywhere.

Phase 2 — Functional: Runs all generated variants (baseline + security + boundary + duplication) on a single browser at full width. Tests the application logic, form validation, error handling, and security.

3.3 — Session Reuse for Efficiency

When intelligence is enabled, the agent keeps the browser window open between variant tests, creating a fresh page context for each test while reusing the browser instance. This dramatically reduces startup time for comprehensive testing sessions.

3.4 — Template Variable System

Test specifications use {{test_data.email}} syntax for dynamic value injection. The intelligence layer generates variants by modifying these values, enabling a single test spec to produce dozens of different test scenarios.

3.5 — Web Dashboard

A FastAPI-powered web interface provides:

  • Interactive test configuration form
  • One-click test execution with live progress
  • Built-in QA report card with copy-paste summaries
  • Screenshot gallery integrated into reports
  • Preset YAML templates for common test types (Signup, Login, Generic Form)

4. Architecture & Technology Stack

Core Technology

Component Technology Purpose
Browser Automation Playwright Cross-browser automation (Chromium, Firefox, WebKit)
Web Framework FastAPI + Uvicorn Dashboard web server
Test Specs YAML / JSON Declarative test definitions
AI Integration Ollama (optional) AI-powered variant generation
Language Python 3.11+ 32 core modules

Module Architecture

src/qa_agent/
├── orchestrator/        # Central execution controller
│   └── orchestrator.py  # Two-phase testing, browser management
├── runner/
│   ├── step_executor.py # Executes individual test steps
│   └── results.py       # Result data structures
├── intelligence/
│   ├── baseline_generator.py  # Deterministic variant generation
│   ├── expander.py            # Combines baseline + AI variants
│   └── ollama_provider.py     # AI integration
├── spec/
│   ├── schema.py        # Test specs, steps, actions
│   ├── loader.py        # YAML/JSON parsing
│   └── validator.py     # Spec validation
├── viewport/            # Device preset configurations
├── evidence/            # Screenshot/log capture
├── reports/             # JSON & HTML report generation
├── browser_matrix/      # Cross-browser orchestration
├── web/                 # FastAPI dashboard
│   ├── app.py
│   └── templates/
└── cli.py               # CLI entry point

5. Intelligence Layer — Smart Test Generation

The intelligence layer automatically generates comprehensive test variants from minimal input. Given a simple test with an email and password, it produces:

Baseline Variants (30+ per field)

  • Email: empty, missing @, double @, no TLD, whitespace, unicode domains, plus addressing, 64+ char local part
  • Password: empty, 1 char, only lowercase, only numbers, leading/trailing spaces, Chinese characters, emoji, backslash, null bytes, common weak passwords (password, 123456)
  • Generic Fields: XSS payloads, SQL injection, template injection, path traversal

Security Variants (45+ payloads)

  • SQL Injection: ' OR 1=1--, '; DROP TABLE users;--, 1 UNION SELECT * FROM users
  • XSS: <script>alert(1)</script>, <img onerror=alert(1)>, javascript:alert(1)
  • Path Traversal: ../../etc/passwd, ..\\..\\windows\\system32
  • Command Injection: ; ls -la, $(cat /etc/passwd)
  • NoSQL Injection: {"$gt": ""}, {"$ne": null}

Deterministic Reproducibility

The same seed always produces the same variants. seed: 1337 generates identical test data on every run, making results reproducible and CI/CD-friendly.

6. Git Commit Timeline — Verifiable Evidence

Every commit is publicly visible on GitHub with author dates set by Git at commit time:

Date (UTC) Commit Milestone
Dec 15, 2025 ccb2759 Initial commit — Stabilize execution flow and enforce core contracts
Dec 15, 2025 c5d0299 Add explicit step-level wait semantics (opt-in)
Dec 15, 2025 eb724a8 Fix selector in simple_test.yaml
Dec 16, 2025 1476245 Web Dashboard — Add MVP web form and copyable QA report card
Dec 16, 2025 c9c4696 Intelligence Layer v1 — Baseline variants + optional Ollama AI provider
Dec 16, 2025 748ce52 Smart Testing — AI-powered auto-detection features added
Dec 18, 2025 0ae2251 Cleanup project — remove 214MB of test artifacts
Dec 26, 2025 a5546b8 Latest Backup — Final December 2025 state
Key Evidence: All 10 commits were made between December 15-26, 2025. The entire autonomous browser agent — including the orchestrator, step executor, intelligence layer, web dashboard, and cross-browser matrix — was built and committed within this 11-day window. These dates are recorded in Git's internal commit objects and are publicly verifiable by anyone who clones the repository.

How to Verify

git clone https://github.com/Wheelder/wheelder-human-agent.git
cd wheelder-human-agent
git log --format="%H %ai %s" --all

# Original source repository:
# https://github.com/abbaays/human-agent

GitHub Commit History — Public Screenshots

The following screenshots show the public commit history on GitHub, proving all original development commits are from December 15-26, 2025:

GitHub Commits - Dec 26 to Dec 16, 2025 GitHub Commits - Dec 16 to Dec 15, 2025 (Initial)

Live commit history: github.com/Wheelder/wheelder-human-agent/commits/main

7. Public Timeline Comparison — Agentic Systems

The following table presents Wheelder Human Agent's development dates alongside the public announcement dates of similar full-stack agentic systems by other companies. All dates are based on publicly available information.

Date Event Evidence
Dec 15-26, 2025 Wheelder Human Agent: Full autonomous browser agent built — orchestrator, step executor, intelligence layer, web dashboard, multi-browser/viewport testing, YAML test specs, security scanning, evidence capture 10 Git commits (public repo)
Jan 2026 Anthropic releases Claude "computer use" capabilities in Claude 3.5 Sonnet Public announcement
Jan-Feb 2026 OpenClaw launches as an autonomous browser agent platform Public launch
Feb 2026 Claude Code released as a full-stack agentic coding system Public release
Feb-Mar 2026 Perplexity launches "Computer" feature for autonomous browser tasks Public announcement
The Core Concept: The idea of an autonomous agent that can open a real browser, navigate to any web page, intelligently interact with UI elements (filling forms, clicking buttons, reading content), capture evidence, and generate reports — this is the foundational concept behind all modern "computer use" agentic systems. Wheelder Human Agent implemented this concept with a working, production-ready system committed to GitHub in December 2025, with publicly verifiable commit dates.

8. Roadmap — Continuing Innovation

Wheelder Human Agent is an active, evolving project. The foundation built in December 2025 continues to expand:

  • Full AI Integration: Deep integration with LLMs (Ollama, Groq) for truly intelligent page understanding — not just pattern matching but semantic comprehension of what's on screen
  • Visual Regression Testing: Pixel-level comparison of screenshots across test runs to detect UI changes
  • Natural Language Test Specs: Write tests in plain English ("go to the signup page, fill in the form, and verify the success message") and have the agent execute them
  • Multi-Page Workflow Testing: Complex user journeys spanning multiple pages with state preservation
  • API + Browser Hybrid Testing: Combine API calls with browser automation for end-to-end testing
  • Self-Healing Selectors: When a CSS selector breaks due to a UI change, automatically find the correct element using AI
  • Continuous Monitoring: Run tests on a schedule and alert on failures — autonomous QA as a service

This project continues to open new doors for innovation in the autonomous agent space.

9. Public Repository

Repository Description
Wheelder/wheelder-human-agent Full source code — 32 Python modules, YAML test specs, web dashboard, intelligence layer
abbaays/human-agent Original source repository (personal account)

Additional Evidence

Document Version: 1.0 | Published: March 7, 2026 | Author: Abdul Baays Fakhri

Wheelder — AI-Powered Research Platform — A Student Project Since 2020, Innovating Since 2023