## Summary Comprehensive fixes for multiple HTMLRewriter bugs including crashes, memory leaks, and improper error handling. ### 🚨 **Primary Issue Fixed** (#21680) - **HTMLRewriter crash when element handlers throw exceptions** - Process would crash with "ASSERTION FAILED: Unexpected exception observed" when JavaScript callbacks in element handlers threw exceptions - **Root cause**: Exceptions weren't properly handled by JavaScriptCore's exception scope mechanism - **Solution**: Used `CatchScope` to properly catch and propagate exceptions through Bun's error handling system ### 🚨 **Additional Bugs Discovered & Fixed** #### 1. **Memory Leaks in Selector Handling** - **Issue**: `selector_slice` string was allocated but never freed when `HTMLSelector.parse()` failed - **Impact**: Memory leak on every invalid CSS selector - **Fix**: Added proper `defer`/`errdefer` cleanup in `on_()` and `onDocument_()` methods #### 2. **Broken Selector Validation** - **Issue**: Invalid CSS selectors were silently succeeding instead of throwing meaningful errors - **Impact**: Silent failures made debugging difficult; invalid selectors like `""`, `"<<<"`, `"div["` were accepted - **Fix**: Changed `return createLOLHTMLError(global)` to `return global.throwValue(createLOLHTMLError(global))` #### 3. **Resource Cleanup on Handler Creation Failures** - **Issue**: Allocated handlers weren't cleaned up if subsequent operations failed - **Impact**: Potential resource leaks in error paths - **Fix**: Added `errdefer` blocks for proper handler cleanup ## Test plan - [x] **Regression test** for original crash case (`test/regression/issue/21680.test.ts`) - [x] **Comprehensive edge case tests** (`test/regression/issue/htmlrewriter-additional-bugs.test.ts`) - [x] **All existing HTMLRewriter tests pass** (41 tests, 146 assertions) - [x] **Memory leak testing** with repeated invalid selector operations - [x] **Security testing** with malicious inputs, XSS attempts, large payloads - [x] **Concurrent usage testing** for thread safety and reuse patterns ### **Before (multiple bugs):** #### Crash: ```bash ASSERTION FAILED: Unexpected exception observed on thread Thread:0xf5a15e0000e0 at: The exception was thrown from thread Thread:0xf5a15e0000e0 at: Error Exception: abc !exception() || m_vm.hasPendingTerminationException() AddressSanitizer: CHECK failed: asan_poisoning.cpp:37 error: script "bd" was terminated by signal SIGABRT (Abort) ``` #### Silent Selector Failures: ```javascript // These should throw but silently succeeded: new HTMLRewriter().on("", handler); // empty selector new HTMLRewriter().on("<<<", handler); // invalid CSS new HTMLRewriter().on("div[", handler); // incomplete attribute ``` ### **After (all issues fixed):** #### Proper Exception Handling: ```javascript try { new HTMLRewriter().on("script", { element(a) { throw new Error("abc"); } }).transform(new Response("<script></script>")); } catch (e) { console.log("GOOD: Caught exception:", e.message); // "abc" } ``` #### Proper Selector Validation: ```javascript // Now properly throws with descriptive errors: new HTMLRewriter().on("", handler); // Throws: "The selector is empty" new HTMLRewriter().on("<<<", handler); // Throws: "The selector is empty" new HTMLRewriter().on("div[", handler); // Throws: "Unexpected end of selector" ``` ## Technical Details ### Exception Handling Fix - Used `CatchScope` to properly catch JavaScript exceptions from callbacks - Captured exceptions in VM's `unhandled_pending_rejection_to_capture` mechanism - Cleared exceptions from scope to prevent assertion failures - Returned failure status to LOLHTML to trigger proper error propagation ### Memory Management Fixes - Added `defer bun.default_allocator.free(selector_slice)` for automatic cleanup - Added `errdefer` blocks for handler cleanup on failures - Ensured all error paths properly release allocated resources ### Error Handling Improvements - Fixed functions returning `bun.JSError!JSValue` to properly throw errors - Distinguished between functions that return errors vs. throw them - Preserved original exception messages through the error chain ## Impact ✅ **No more process crashes** when HTMLRewriter handlers throw exceptions ✅ **No memory leaks** from failed selector parsing operations ✅ **Proper error messages** for invalid CSS selectors with specific failure reasons ✅ **Improved reliability** across all edge cases and malicious inputs ✅ **Maintains 100% backward compatibility** - all existing functionality preserved This makes HTMLRewriter significantly more robust and developer-friendly while maintaining high performance. Fixes #21680 🤖 Generated with [Claude Code](https://claude.ai/code) --------- Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
Bun
Read the docs →
What is Bun?
Bun is an all-in-one toolkit for JavaScript and TypeScript apps. It ships as a single executable called bun.
At its core is the Bun runtime, a fast JavaScript runtime designed as a drop-in replacement for Node.js. It's written in Zig and powered by JavaScriptCore under the hood, dramatically reducing startup times and memory usage.
bun run index.tsx # TS and JSX supported out-of-the-box
The bun command-line tool also implements a test runner, script runner, and Node.js-compatible package manager. Instead of 1,000 node_modules for development, you only need bun. Bun's built-in tools are significantly faster than existing options and usable in existing Node.js projects with little to no changes.
bun test # run tests
bun run start # run the `start` script in `package.json`
bun install <pkg> # install a package
bunx cowsay 'Hello, world!' # execute a package
Install
Bun supports Linux (x64 & arm64), macOS (x64 & Apple Silicon) and Windows (x64).
Linux users — Kernel version 5.6 or higher is strongly recommended, but the minimum is 5.1.
x64 users — if you see "illegal instruction" or similar errors, check our CPU requirements
# with install script (recommended)
curl -fsSL https://bun.com/install | bash
# on windows
powershell -c "irm bun.com/install.ps1 | iex"
# with npm
npm install -g bun
# with Homebrew
brew tap oven-sh/bun
brew install bun
# with Docker
docker pull oven/bun
docker run --rm --init --ulimit memlock=-1:-1 oven/bun
Upgrade
To upgrade to the latest version of Bun, run:
bun upgrade
Bun automatically releases a canary build on every commit to main. To upgrade to the latest canary build, run:
bun upgrade --canary
Quick links
-
Intro
-
Templating
-
CLI
-
Runtime
-
Package manager
-
Bundler
-
Test runner
-
Package runner
-
API
- HTTP server (
Bun.serve) - WebSockets
- Workers
- Binary data
- Streams
- File I/O (
Bun.file) - import.meta
- SQLite (
bun:sqlite) - PostgreSQL (
Bun.sql) - Redis (
Bun.redis) - S3 Client (
Bun.s3) - FileSystemRouter
- TCP sockets
- UDP sockets
- Globals
- $ Shell
- Child processes (spawn)
- Transpiler (
Bun.Transpiler) - Hashing
- Colors (
Bun.color) - Console
- FFI (
bun:ffi) - C Compiler (
bun:fficc) - HTMLRewriter
- Testing (
bun:test) - Cookies (
Bun.Cookie) - Utils
- Node-API
- Glob (
Bun.Glob) - Semver (
Bun.semver) - DNS
- fetch API extensions
- HTTP server (
Guides
-
Binary
- Convert a Blob to a string
- Convert a Buffer to a blob
- Convert a Blob to a DataView
- Convert a Buffer to a string
- Convert a Blob to a ReadableStream
- Convert a Blob to a Uint8Array
- Convert a DataView to a string
- Convert a Uint8Array to a Blob
- Convert a Blob to an ArrayBuffer
- Convert an ArrayBuffer to a Blob
- Convert a Buffer to a Uint8Array
- Convert a Uint8Array to a Buffer
- Convert a Uint8Array to a string
- Convert a Buffer to an ArrayBuffer
- Convert an ArrayBuffer to a Buffer
- Convert an ArrayBuffer to a string
- Convert a Uint8Array to a DataView
- Convert a Buffer to a ReadableStream
- Convert a Uint8Array to an ArrayBuffer
- Convert an ArrayBuffer to a Uint8Array
- Convert an ArrayBuffer to an array of numbers
- Convert a Uint8Array to a ReadableStream
-
Ecosystem
- Use React and JSX
- Use EdgeDB with Bun
- Use Prisma with Bun
- Add Sentry to a Bun app
- Create a Discord bot
- Run Bun as a daemon with PM2
- Use Drizzle ORM with Bun
- Build an app with Nuxt and Bun
- Build an app with Qwik and Bun
- Build an app with Astro and Bun
- Build an app with Remix and Bun
- Build a frontend using Vite and Bun
- Build an app with Next.js and Bun
- Run Bun as a daemon with systemd
- Deploy a Bun application on Render
- Build an HTTP server using Hono and Bun
- Build an app with SvelteKit and Bun
- Build an app with SolidStart and Bun
- Build an HTTP server using Elysia and Bun
- Build an HTTP server using StricJS and Bun
- Containerize a Bun application with Docker
- Build an HTTP server using Express and Bun
- Use Neon Postgres through Drizzle ORM
- Server-side render (SSR) a React component
- Read and write data to MongoDB using Mongoose and Bun
- Use Neon's Serverless Postgres with Bun
-
HTMLRewriter
-
HTTP
- Hot reload an HTTP server
- Common HTTP server usage
- Write a simple HTTP server
- Configure TLS on an HTTP server
- Send an HTTP request using fetch
- Proxy HTTP requests using fetch()
- Start a cluster of HTTP servers
- Stream a file as an HTTP Response
- fetch with unix domain sockets in Bun
- Upload files via HTTP using FormData
- Streaming HTTP Server with Async Iterators
- Streaming HTTP Server with Node.js Streams
-
Install
- Add a dependency
- Add a Git dependency
- Add a peer dependency
- Add a trusted dependency
- Add a development dependency
- Add a tarball dependency
- Add an optional dependency
- Generate a yarn-compatible lockfile
- Configuring a monorepo using workspaces
- Install a package under a different name
- Install dependencies with Bun in GitHub Actions
- Using bun install with Artifactory
- Configure git to diff Bun's lockb lockfile
- Override the default npm registry for bun install
- Using bun install with an Azure Artifacts npm registry
- Migrate from npm install to bun install
- Configure a private registry for an organization scope with bun install
-
Process
-
Read file
-
Runtime
- Delete files
- Run a Shell Command
- Import a JSON file
- Import a TOML file
- Set a time zone in Bun
- Set environment variables
- Re-map import paths
- Delete directories
- Read environment variables
- Import a HTML file as text
- Install and run Bun in GitHub Actions
- Debugging Bun with the web debugger
- Install TypeScript declarations for Bun
- Debugging Bun with the VS Code extension
- Inspect memory usage using V8 heap snapshots
- Define and replace static globals & constants
- Codesign a single-file JavaScript executable on macOS
-
Streams
- Convert a ReadableStream to JSON
- Convert a ReadableStream to a Blob
- Convert a ReadableStream to a Buffer
- Convert a ReadableStream to a string
- Convert a ReadableStream to a Uint8Array
- Convert a ReadableStream to an array of chunks
- Convert a Node.js Readable to JSON
- Convert a ReadableStream to an ArrayBuffer
- Convert a Node.js Readable to a Blob
- Convert a Node.js Readable to a string
- Convert a Node.js Readable to an Uint8Array
- Convert a Node.js Readable to an ArrayBuffer
-
Test
- Spy on methods in
bun test - Bail early with the Bun test runner
- Mock functions in
bun test - Run tests in watch mode with Bun
- Use snapshot testing in
bun test - Skip tests with the Bun test runner
- Using Testing Library with Bun
- Update snapshots in
bun test - Run your tests with the Bun test runner
- Set the system time in Bun's test runner
- Set a per-test timeout with the Bun test runner
- Migrate from Jest to Bun's test runner
- Write browser DOM tests with Bun and happy-dom
- Mark a test as a "todo" with the Bun test runner
- Re-run tests multiple times with the Bun test runner
- Generate code coverage reports with the Bun test runner
- import, require, and test Svelte components with bun test
- Set a code coverage threshold with the Bun test runner
- Spy on methods in
-
Util
- Generate a UUID
- Hash a password
- Escape an HTML string
- Get the current Bun version
- Encode and decode base64 strings
- Compress and decompress data with gzip
- Sleep for a fixed number of milliseconds
- Detect when code is executed with Bun
- Check if two objects are deeply equal
- Compress and decompress data with DEFLATE
- Get the absolute path to the current entrypoint
- Get the directory of the current file
- Check if the current file is the entrypoint
- Get the file name of the current file
- Convert a file URL to an absolute path
- Convert an absolute path to a file URL
- Get the absolute path of the current file
- Get the path to an executable bin file
-
WebSocket
-
Write file
Contributing
Refer to the Project > Contributing guide to start contributing to Bun.
License
Refer to the Project > License page for information about Bun's licensing.