Markdown Converter
Agent skill for markdown-converter
Control browser automation through HTTP API. Supports page navigation, element interaction (click, type, select), data extraction, accessibility snapshot analysis, screenshot, JavaScript execution, and batch operations.
Sign in to like and favorite skills
BrowserWing Executor provides comprehensive browser automation capabilities through HTTP APIs. You can control browser navigation, interact with page elements, extract data, and analyze page structure.
API Base URL:
http://localhost:8080/api/v1/executor, host=localhost:8080
IMPORTANT: Always call this endpoint first to see all available commands and their parameters.
curl -X GET 'http://{host}/api/v1/executor/help'
Response: Returns complete list of all commands with parameters, examples, and usage guidelines.
Query specific command:
curl -X GET 'http://{host}/api/v1/executor/help?command=extract'
CRITICAL: Always call this after navigation to understand page structure and get element RefIDs.
curl -X GET 'http://{host}/api/v1/executor/snapshot'
Response Example:
{ "success": true, "snapshot_text": "Clickable Elements:\n @e1 Login (role: button)\n @e2 Sign Up (role: link)\n\nInput Elements:\n @e3 Email (role: textbox) [placeholder: [email protected]]\n @e4 Password (role: textbox)" }
Use Cases:
curl -X POST 'http://{host}/api/v1/executor/navigate' \ -H 'Content-Type: application/json' \ -d '{"url": "https://example.com"}'
curl -X POST 'http://{host}/api/v1/executor/click' \ -H 'Content-Type: application/json' \ -d '{"identifier": "@e1"}'
Identifier formats:
@e1, @e2 (from snapshot)#button-id, .class-name//button[@type='submit']Login (text content)curl -X POST 'http://{host}/api/v1/executor/type' \ -H 'Content-Type: application/json' \ -d '{"identifier": "@e3", "text": "[email protected]"}'
curl -X POST 'http://{host}/api/v1/executor/extract' \ -H 'Content-Type: application/json' \ -d '{ "selector": ".product-item", "fields": ["text", "href"], "multiple": true }'
curl -X POST 'http://{host}/api/v1/executor/wait' \ -H 'Content-Type: application/json' \ -d '{"identifier": ".loading", "state": "hidden", "timeout": 10}'
curl -X POST 'http://{host}/api/v1/executor/batch' \ -H 'Content-Type: application/json' \ -d '{ "operations": [ {"type": "navigate", "params": {"url": "https://example.com"}, "stop_on_error": true}, {"type": "click", "params": {"identifier": "@e1"}, "stop_on_error": true}, {"type": "type", "params": {"identifier": "@e3", "text": "query"}, "stop_on_error": true} ] }'
List all tabs:
curl -X POST 'http://{host}/api/v1/mcp/message' \ -H 'Content-Type: application/json' \ -d '{ "method": "tools/call", "params": { "name": "browser_tabs", "arguments": {"action": "list"} } }'
Create new tab:
curl -X POST 'http://{host}/api/v1/mcp/message' \ -H 'Content-Type: application/json' \ -d '{ "method": "tools/call", "params": { "name": "browser_tabs", "arguments": { "action": "new", "url": "https://example.com" } } }'
Switch to tab (by index):
curl -X POST 'http://{host}/api/v1/mcp/message' \ -H 'Content-Type: application/json' \ -d '{ "method": "tools/call", "params": { "name": "browser_tabs", "arguments": { "action": "switch", "index": 1 } } }'
Close tab (by index):
curl -X POST 'http://{host}/api/v1/mcp/message' \ -H 'Content-Type: application/json' \ -d '{ "method": "tools/call", "params": { "name": "browser_tabs", "arguments": { "action": "close", "index": 2 } } }'
Note: Tab indices are 0-based (first tab is 0, second tab is 1, etc.)
Alternative: Direct HTTP API (without MCP):
# List tabs curl -X POST 'http://{host}/api/v1/executor/tabs' \ -H 'Content-Type: application/json' \ -d '{"action": "list"}' # Create new tab curl -X POST 'http://{host}/api/v1/executor/tabs' \ -H 'Content-Type: application/json' \ -d '{"action": "new", "url": "https://example.com"}' # Switch to tab curl -X POST 'http://{host}/api/v1/executor/tabs' \ -H 'Content-Type: application/json' \ -d '{"action": "switch", "index": 1}' # Close tab curl -X POST 'http://{host}/api/v1/executor/tabs' \ -H 'Content-Type: application/json' \ -d '{"action": "close", "index": 2}'
Fill a login form:
curl -X POST 'http://{host}/api/v1/mcp/message' \ -H 'Content-Type: application/json' \ -d '{ "method": "tools/call", "params": { "name": "browser_fill_form", "arguments": { "fields": [ {"name": "username", "value": "[email protected]"}, {"name": "password", "value": "secret123"}, {"name": "remember", "value": true} ], "submit": true } } }'
Fill a registration form:
curl -X POST 'http://{host}/api/v1/mcp/message' \ -H 'Content-Type: application/json' \ -d '{ "method": "tools/call", "params": { "name": "browser_fill_form", "arguments": { "fields": [ {"name": "email", "value": "[email protected]"}, {"name": "name", "value": "John Doe"}, {"name": "age", "value": 25}, {"name": "country", "value": "United States"}, {"name": "subscribe", "value": true} ], "submit": false } } }'
Supported field types:
Field finding strategies:
Alternative: Direct HTTP API (without MCP):
# Fill a login form curl -X POST 'http://{host}/api/v1/executor/fill-form' \ -H 'Content-Type: application/json' \ -d '{ "fields": [ {"name": "username", "value": "[email protected]"}, {"name": "password", "value": "secret123"}, {"name": "remember", "value": true} ], "submit": true, "timeout": 10 }' # Fill a registration form (without submit) curl -X POST 'http://{host}/api/v1/executor/fill-form' \ -H 'Content-Type: application/json' \ -d '{ "fields": [ {"name": "email", "value": "[email protected]"}, {"name": "name", "value": "John Doe"}, {"name": "age", "value": 25}, {"name": "country", "value": "United States"} ], "submit": false }'
Response Example:
{ "success": true, "message": "Successfully filled 3/3 fields and submitted form", "data": { "filled_count": 3, "total_fields": 3, "errors": [], "submitted": true } }
Step-by-step workflow:
Discover commands: Call
GET /help to see all available operations and their parameters (do this first if unsure).
Navigate: Use
POST /navigate to open the target webpage.
Analyze page: Call
GET /snapshot to understand page structure and get element RefIDs.
Interact: Use element RefIDs (like
@e1, @e2) or CSS selectors to:
POST /clickPOST /typePOST /selectPOST /waitExtract data: Use
POST /extract to get information from the page.
Present results: Format and show extracted data to the user.
User Request: "Search for 'laptop' on example.com and get the first 5 results"
Your Actions:
curl -X POST 'http://{host}/api/v1/executor/navigate' \ -H 'Content-Type: application/json' \ -d '{"url": "https://example.com/search"}'
curl -X GET 'http://{host}/api/v1/executor/snapshot'
Response shows:
@e3 Search (role: textbox) [placeholder: Search...]
curl -X POST 'http://{host}/api/v1/executor/type' \ -H 'Content-Type: application/json' \ -d '{"identifier": "@e3", "text": "laptop"}'
curl -X POST 'http://{host}/api/v1/executor/press-key' \ -H 'Content-Type: application/json' \ -d '{"key": "Enter"}'
curl -X POST 'http://{host}/api/v1/executor/wait' \ -H 'Content-Type: application/json' \ -d '{"identifier": ".search-results", "state": "visible", "timeout": 10}'
curl -X POST 'http://{host}/api/v1/executor/extract' \ -H 'Content-Type: application/json' \ -d '{ "selector": ".result-item", "fields": ["text", "href"], "multiple": true }'
Found 15 results for 'laptop': 1. Gaming Laptop - $1299 (https://...) 2. Business Laptop - $899 (https://...) ...
POST /navigate - Navigate to URLPOST /go-back - Go back in historyPOST /go-forward - Go forward in historyPOST /reload - Reload current pagePOST /click - Click element (supports: RefID @e1, CSS selector, XPath, text content)POST /type - Type text into input (supports: RefID @e3, CSS selector, XPath)POST /select - Select dropdown optionPOST /hover - Hover over elementPOST /wait - Wait for element state (visible, hidden, enabled)POST /press-key - Press keyboard key (Enter, Tab, Ctrl+S, etc.)POST /extract - Extract data from elements (supports multiple elements, custom fields)POST /get-text - Get element text contentPOST /get-value - Get input element valueGET /page-info - Get page URL and titleGET /page-text - Get all page textGET /page-content - Get full HTMLGET /snapshot - Get accessibility snapshot with RefIDs (⭐ ALWAYS call after navigation)GET /clickable-elements - Get all clickable elementsGET /input-elements - Get all input elementsPOST /screenshot - Take page screenshot (base64 encoded)POST /evaluate - Execute JavaScript codePOST /batch - Execute multiple operations in sequencePOST /scroll-to-bottom - Scroll to page bottomPOST /resize - Resize browser windowYou can identify elements using:
RefID (Recommended):
@e1, @e2, @e3
/snapshot endpoint"identifier": "@e1"CSS Selector:
#id, .class, button[type="submit"]
"identifier": "#login-button"Text Content:
Login, Sign Up, Submit
"identifier": "Login"XPath:
//button[@id='login']
"identifier": "//button[@id='login']"ARIA Label: Elements with
aria-label attribute
Before starting:
GET /help if you're unsure about available commands or their parametersDuring automation:
/snapshot after navigation to get page structure and RefIDs@e1) over CSS selectors for reliability and stability/wait for dynamic content that loads asynchronously/batch for multiple sequential operations to improve efficiencyError handling:
/snapshot again to refresh page structure and get new RefIDsData extraction:
fields parameter to specify what to extract: ["text", "href", "src"]multiple: true to extract from multiple elementsScenario: User wants to login to a website
User: "Please log in to example.com with username 'john' and password 'secret123'"
Your Actions:
Step 1: Navigate to login page
POST http://{host}/api/v1/executor/navigate {"url": "https://example.com/login"}
Step 2: Get page structure
GET http://{host}/api/v1/executor/snapshot
Response:
Clickable Elements: @e1 Login (role: button) Input Elements: @e2 Username (role: textbox) @e3 Password (role: textbox)
Step 3: Enter username
POST http://{host}/api/v1/executor/type {"identifier": "@e2", "text": "john"}
Step 4: Enter password
POST http://{host}/api/v1/executor/type {"identifier": "@e3", "text": "secret123"}
Step 5: Click login button
POST http://{host}/api/v1/executor/click {"identifier": "@e1"}
Step 6: Wait for login success (optional)
POST http://{host}/api/v1/executor/wait {"identifier": ".welcome-message", "state": "visible", "timeout": 10}
Step 7: Inform user
"Successfully logged in to example.com!"
Scenario: Fill out a form with multiple fields
Instead of making 5 separate API calls, use one batch operation:
curl -X POST 'http://{host}/api/v1/executor/batch' \ -H 'Content-Type: application/json' \ -d '{ "operations": [ { "type": "navigate", "params": {"url": "https://example.com/form"}, "stop_on_error": true }, { "type": "type", "params": {"identifier": "#name", "text": "John Doe"}, "stop_on_error": true }, { "type": "type", "params": {"identifier": "#email", "text": "[email protected]"}, "stop_on_error": true }, { "type": "select", "params": {"identifier": "#country", "value": "United States"}, "stop_on_error": true }, { "type": "click", "params": {"identifier": "#submit"}, "stop_on_error": true } ] }'
/help or /help?command=<name> to learn about commands/semantic-tree after navigation to understand the page/wait before interacting with elements that load asynchronously/batch for multiple sequential operations/type for each field: @e1, @e2, etc./select for dropdowns/wait/extract with CSS selector and multiple: true["text", "href", "src"]@e2@e3@e1wait_visible: true (default) for reliable element interaction{host} with actual API host addressX-BrowserWing-Key header or JWT tokenElement not found:
/snapshot to see available elements and their RefIDsTimeout errors:
/wait with appropriate state before interactionExtraction returns empty:
/wait first)# Discover commands GET http://{host}/api/v1/executor/help # Navigate POST http://{host}/api/v1/executor/navigate {"url": "..."} # Get page structure GET http://{host}/api/v1/executor/snapshot # Click element POST http://{host}/api/v1/executor/click {"identifier": "@e1"} # Type text POST http://{host}/api/v1/executor/type {"identifier": "@e3", "text": "..."} # Extract data POST http://{host}/api/v1/executor/extract {"selector": "...", "fields": [...], "multiple": true}
All operations return:
{ "success": true, "message": "Operation description", "timestamp": "2026-01-15T10:30:00Z", "data": { // Operation-specific data } }
Error response:
{ "error": "error.operationFailed", "detail": "Detailed error message" }