bun.sh

mirror of https://github.com/oven-sh/bun synced 2026-02-10 19:08:50 +00:00

Author	SHA1	Message	Date
Sosuke Suzuki	1b3500dca2	Update bindings for reduced QueuedTask size - Remove performMicrotaskFunction argument from BunPerformMicrotaskJob - Async context is now handled directly in JSMicrotask.cpp - Update queueMicrotaskJob to use 4 arguments instead of 5	2025-12-29 21:25:06 +09:00
Tommy D. Rossi	538be1399c	feat(bundler): expose reactFastRefresh option in Bun.build API (#25731 ) Fixes #25716 Adds support for a `reactFastRefresh: boolean` option in the `Bun.build` JavaScript API, matching the existing `--react-fast-refresh` CLI flag. ```ts const result = await Bun.build({ reactFastRefresh: true, entrypoints: ["src/App.tsx"], }); ``` When enabled, the bundler adds React Fast Refresh transform code (`$RefreshReg$`, `$RefreshSig$`) to the output.	2025-12-28 22:07:47 -08:00
robobun	d04b86d34f	perf: use jsonStringifyFast for faster JSON serialization (#25733 ) ## Summary Apply the same optimization technique from PR #25717 (Response.json) to other APIs that use JSON.stringify internally: - IPC message serialization (`ipc.zig`) - used for inter-process communication - console.log with %j format (`ConsoleObject.zig`) - commonly used for debugging - PostgreSQL JSON/JSONB types (`PostgresRequest.zig`) - database operations - MySQL JSON type (`MySQLTypes.zig`) - database operations - Jest %j/%o format specifiers (`jest.zig`) - test output formatting - Transpiler tsconfig/macros (`JSTranspiler.zig`) - build configuration ### Root Cause When calling `JSONStringify(globalObject, value, 0)`, the space parameter `0` becomes `jsNumber(0)`, which is NOT `undefined`. This causes JSC's FastStringifier (SIMD-optimized) to bail out: ```cpp // In WebKit's JSONObject.cpp FastStringifier::stringify() if (!space.isUndefined()) { logOutcome("space"_s); return { }; // Bail out to slow path } ``` Using `jsonStringifyFast` which passes `jsUndefined()` triggers the fast path. ### Expected Performance Improvement Based on PR #25717 results, these changes should provide ~3x speedup for JSON serialization in the affected APIs. ## Test plan - [x] Debug build compiles successfully - [x] Basic functionality verified (IPC, console.log %j, Response.json) - [x] Existing tests pass 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com>	2025-12-28 18:01:07 -08:00
robobun	37fc8e99f7	Harden WebSocket client decompression (#25724 ) ## Summary - Add maximum decompressed message size limit to WebSocket client deflate handling - Add test coverage for decompression limits ## Test plan - Run `bun test test/js/web/websocket/websocket-permessage-deflate-edge-cases.test.ts` 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com>	2025-12-28 17:58:24 -08:00
robobun	6b5de25d8a	feat(shell): add $.trace for analyzing shell commands without execution (#25667 ) ## Summary Adds `Bun.$.trace` for tracing shell commands without executing them. ```js const result = $.trace`cat /tmp/file.txt > output.txt`; // { operations: [...], cwd: "...", success: true, error: null } ``` ## Test plan - [x] `bun bd test test/js/bun/shell/trace.test.ts` 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: Jarred Sumner <jarred@jarredsumner.com>	2025-12-27 17:25:52 -08:00
Alex Miller	7b49654db6	fix(io): Prevent data corruption in `Bun.write` for files >2GB (#25720 ) Closes #8254 Fixes a data corruption bug in `Bun.write()` where files larger than 2GB would have chunks skipped resulting in corrupted output with missing data. The `doWriteLoop` had an issue where it would essentially end up offsetting twice every 2GB chunks: - it first sliced the buffer by `total_written`: ```remain = remain[@min(this.total_written, remain.len)..]``` - it would then increment `bytes_blob.offset`: `this.bytes_blob.offset += @truncate(wrote)` but because `sharedView()` already uses the blob offset `slice_ = slice_[this.offset..]` it would end up doubling the offset. In a local reproduction writing a 16GB file with each 2GB chunk filled with incrementing values `[1, 2, 3, 4, 5, 6, 7, 8]`, the buggy version produced: `[1, 3, 5, 7, …]`, skipping every other chunk. The fix is to simply remove the redundant manual offset and rely only on `total_written` to track write progress.	2025-12-27 16:58:36 -08:00
SUZUKI Sosuke	603bbd18a0	Enable `CHECK_REF_COUNTED_LIFECYCLE` in WebKit (#25705 ) ### What does this PR do? Enables `CHECK_REF_COUNTED_LIFECYCLE` in WebKit ( https://github.com/oven-sh/WebKit/pull/121 ) See also `a978fae619` #### `CHECK_REF_COUNTED_LIFECYCLE`? A compile-time macro that enables lifecycle validation for reference-counted objects in debug builds. Definition ```cpp #if ASSERT_ENABLED \|\| ENABLE(SECURITY_ASSERTIONS) #define CHECK_REF_COUNTED_LIFECYCLE 1 #else #define CHECK_REF_COUNTED_LIFECYCLE 0 #endif ``` Purpose Detects three categories of bugs: 1. Missing adoption - Objects stored in RefPtr without using adoptRef() 2. Ref during destruction - ref() called while destructor is running (causes dangling pointers) 3. Thread safety violations - Unsafe ref/deref across threads Implementation When enabled, RefCountDebugger adds two tracking flags: - m_deletionHasBegun - Set when destructor starts - m_adoptionIsRequired - Cleared when adoptRef() is called These flags are checked on every ref()/deref() call, with assertions failing on violations. Motivation Refactored debug code into a separate RefCountDebugger class to: - Improve readability of core refcount logic - Eliminate duplicate code across RefCounted, ThreadSafeRefCounted, etc. - Simplify adding new refcount classes Overhead Zero in release builds - the flags and checks are compiled out entirely. ### How did you verify your code works? --------- Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-27 15:02:11 -08:00
robobun	1d7cb4bbad	perf(Response.json): use JSC's FastStringifier by passing undefined for space (#25717 ) ## Summary - Fix performance regression where `Response.json()` was 2-3x slower than `JSON.stringify() + new Response()` - Root cause: The existing code called `JSC::JSONStringify` with `indent=0`, which internally passes `jsNumber(0)` as the space parameter. This bypasses WebKit's FastStringifier optimization. - Fix: Add a new `jsonStringifyFast` binding that passes `jsUndefined()` for the space parameter, triggering JSC's FastStringifier (SIMD-optimized) code path. ## Root Cause Analysis In WebKit's `JSONObject.cpp`, the `stringify()` function has this logic: ```cpp static NEVER_INLINE String stringify(JSGlobalObject& globalObject, JSValue value, JSValue replacer, JSValue space) { // ... if (String result = FastStringifier<Latin1Character, BufferMode::StaticBuffer>::stringify(globalObject, value, replacer, space, failureReason); !result.isNull()) return result; // Falls back to slow Stringifier... } ``` And `FastStringifier::stringify()` checks: ```cpp if (!space.isUndefined()) { logOutcome("space"_s); return { }; // Bail out to slow path } ``` So when we called `JSONStringify(globalObject, value, (unsigned)0)`, it converted to `jsNumber(0)` which is NOT `undefined`, causing FastStringifier to bail out. ## Performance Results ### Before (3.5x slower than manual approach) ``` Response.json(): 2415ms JSON.stringify() + Response(): 689ms Ratio: 3.50x ``` ### After (parity with manual approach) ``` Response.json(): ~700ms JSON.stringify() + Response(): ~700ms Ratio: ~1.09x ``` ## Test plan - [x] Existing `Response.json()` tests pass (`test/regression/issue/21257.test.ts`) - [x] Response tests pass (`test/js/web/fetch/response.test.ts`) - [x] Manual verification that output is correct for various JSON inputs Fixes #25693 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: Sosuke Suzuki <sosuke@bun.com>	2025-12-27 15:01:28 -08:00
Oleksandr Herasymov	d3a5f2eef2	perf: speed up Bun.hash.crc32 by switching to zlib CRC32 (#25692 ) ## What does this PR do? Switch `Bun.hash.crc32` to use `zlib`'s CRC32 implementation. Bun already links `zlib`, which provides highly optimized, hardware-accelerated CRC32. Because `zlib.crc32` takes a 32-bit length, chunk large inputs to avoid truncation/regressions on buffers >4GB. Note: This was tried before (PR #12164 by Jarred), which switched CRC32 to zlib for speed. This proposal keeps that approach and adds explicit chunking to avoid the >4GB length pitfall. Problem: `Bun.hash.crc32` is a significant outlier in microbenchmarks compared to other hash functions (about 21x slower than `zlib.crc32` in a 1MB test on M1). Root cause: `Bun.hash.crc32` uses Zig's `std.hash.Crc32` implementation, which is software-only and does not leverage hardware acceleration (e.g., `PCLMULQDQ` on x86 or `CRC32` instructions on ARM). Implementation: https://github.com/oven-sh/bun/blob/main/src/bun.js/api/HashObject.zig ```zig pub const crc32 = hashWrap(struct { pub fn hash(seed: u32, bytes: []const u8) u32 { // zlib takes a 32-bit length, so chunk large inputs to avoid truncation. var crc: u64 = seed; var offset: usize = 0; while (offset < bytes.len) { const remaining = bytes.len - offset; const max_len: usize = std.math.maxInt(u32); const chunk_len: u32 = if (remaining > max_len) @intCast(max_len) else @intCast(remaining); crc = bun.zlib.crc32(crc, bytes.ptr + offset, chunk_len); offset += chunk_len; } return @intCast(crc); } }); ``` ### How did you verify your code works? Benchmark (1MB payload): - Before: Bun 1.3.5 `Bun.hash.crc32` = 2,644,444 ns/op vs `zlib.crc32` = 124,324 ns/op (~21x slower) - After (local bun-debug): `Bun.hash.crc32` = 360,591 ns/op vs `zlib.crc32` = 359,069 ns/op (~1.0x), results match ## Test environment - Machine: MacBook Pro 13" (M1, 2020) - OS: macOS 15.7.3 - Baseline Bun: 1.3.5 - After Bun: local `bun-debug` (build/debug)	2025-12-26 23:41:10 -08:00
robobun	b51e993bc2	fix: reject null bytes in spawn args, env, and shell arguments (#25698 ) ## Summary - Reject null bytes in command-line arguments passed to `Bun.spawn` and `Bun.spawnSync` - Reject null bytes in environment variable keys and values - Reject null bytes in shell (`$`) template literal arguments This prevents null byte injection attacks (CWE-158) where null bytes in strings could cause unintended truncation when passed to the OS, potentially allowing attackers to bypass file extension validation or create files with unexpected names. ## Test plan - [x] Added tests in `test/js/bun/spawn/null-byte-injection.test.ts` - [x] Tests pass with debug build: `bun bd test test/js/bun/spawn/null-byte-injection.test.ts` - [x] Tests fail with system Bun (confirming the fix works) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Jarred Sumner <jarred@jarredsumner.com>	2025-12-26 23:39:37 -08:00
Dylan Conway	d0bd1b121f	Fix DCE producing invalid syntax for empty objects in spreads (#25710 ) ## Summary - Fixes dead code elimination producing invalid syntax like `{ ...a, x: }` when simplifying empty objects in spread contexts - The issue was that `simplifyUnusedExpr` and `joinAllWithCommaCallback` could return `E.Missing` instead of `null` to indicate "no side effects" - Added checks to return `null` when the result is `E.Missing` Fixes #25609 ## Test plan - [x] Added regression test that fails on v1.3.5 and passes with fix - [x] `bun bd test test/regression/issue/25609.test.ts` passes 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude <noreply@anthropic.com>	2025-12-26 21:48:19 -08:00
Nico Cevallos	5715b54614	add test for dependency order when a package's name is larger than 8 characters + fix (#25697 ) ### What does this PR do? - Add test that is broken before the changes in the code and fix previous test making script in dependency takes a bit of time to be executed. Without the `setTimeout` in the tests, due race conditions it always success. I tried adding a test combining both tests, with dependencies `dep0` and `larger-than-8-char`, but if the timeout is the same it success. - Fix for the use case added, by using the correct buffer for `Dependency.name` otherwise it gets garbage when package name is larger than 8 characters. This should fix #12203 ### How did you verify your code works? Undo the changes in the code to verify the new test fails and check it again after adding the changes in the code.	2025-12-25 23:49:23 -08:00
SUZUKI Sosuke	699d8b1e1c	Upgrade WebKit Dec 24, 2025 (#25684 ) - WTFMove → WTF::move / std::move: Replaced WTFMove() macro with WTF::move() function for WTF types, std::move() for std types - SortedArrayMap removed: Replaced with if-else chains in EventFactory.cpp, JSCryptoKeyUsage.cpp - Wasm::Memory::create signature changed: Removed VM parameter - URLPattern allocation: Changed from WTF_MAKE_ISO_ALLOCATED to WTF_MAKE_TZONE_ALLOCATED --------- Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>	2025-12-25 14:00:58 -08:00
SUZUKI Sosuke	bffccf3d5f	Upgrade WebKit 2025/12/07 (#25429 ) Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: Jarred Sumner <jarred@jarredsumner.com> Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com> Co-authored-by: Claude Bot <claude-bot@bun.sh>	2025-12-23 22:24:18 -08:00
robobun	8484e1b827	perf: shrink ConcurrentTask from 24 bytes to 16 bytes (#25636 )	2025-12-22 12:07:24 -08:00
robobun	3898ed5e3f	perf: pack boolean flags and reorder fields to reduce struct padding (#25627 )	2025-12-21 17:12:42 -08:00
Jarred Sumner	c08ffadf56	perf(linux): add memfd optimizations and typed flags (#25597 ) ## Summary - Add `MemfdFlags` enum to replace raw integer flags for `memfd_create`, providing semantic clarity for different use cases (`executable`, `non_executable`, `cross_process`) - Add support for `MFD_EXEC` and `MFD_NOEXEC_SEAL` flags (Linux 6.3+) with automatic fallback to older kernel flags when `EINVAL` is returned - Use memfd + `/proc/self/fd/{fd}` path for loading embedded `.node` files in standalone builds, avoiding disk writes entirely on Linux ## Test plan - [ ] Verify standalone builds with embedded `.node` files work on Linux - [ ] Verify fallback works on older kernels (pre-6.3) - [ ] Verify subprocess stdio memfd still works correctly 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-12-19 23:18:21 -08:00
Dylan Conway	fa983247b2	fix(create): crash when running postinstall task with --no-install (#25616 ) ## Summary - Fix segmentation fault in `bun create` when using `--no-install` with a template that has a `bun-create.postinstall` task starting with "bun " - The bug was caused by unconditionally slicing `argv[2..]` which created an empty array when `npm_client` was null - Added check for `npm_client != null` before slicing ## Reproduction ```bash # Create template with bun-create.postinstall mkdir -p ~/.bun-create/test-template echo '{"name":"test","bun-create":{"postinstall":"bun install"}}' > ~/.bun-create/test-template/package.json # This would crash before the fix bun create test-template /tmp/my-app --no-install ``` ## Test plan - [x] Verified the reproduction case crashes before the fix - [x] Verified the reproduction case works after the fix 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-19 23:17:51 -08:00
Dylan Conway	99b0a16c33	fix: prevent out-of-bounds access in NO_PROXY parsing (#25617 ) ## Summary - Fix out-of-bounds access when parsing `NO_PROXY` environment variable with empty entries - Empty entries (e.g., `"localhost, , example.com"`) would cause a panic when checking if the host starts with a dot - Skip empty entries after trimming whitespace fixes BUN-110G fixes BUN-128V ## Test plan - [x] Verify `NO_PROXY="localhost, , example.com"` no longer crashes 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-12-19 23:17:29 -08:00
Dylan Conway	085e25d5d1	fix: protect StringOrBuffer from GC in async operations (#25594 ) ## Summary - Fix use-after-free crash in async zstd compression, scrypt, and JSTranspiler operations - When `StringOrBuffer.fromJSMaybeAsync` is called with `is_async=true`, the buffer's JSValue is now protected from garbage collection - Previously, the buffer could be GC'd while a worker thread was still accessing it, causing segfaults in zstd's `HIST_count_simple` and similar functions Fixes BUN-167Z ## Changes - `fromJSMaybeAsync`: Call `protect()` on buffer when `is_async=true` - `fromJSWithEncodingMaybeAsync`: Same protection for the early return path - `Scrypt`: Fix cleanup to use `deinitAndUnprotect()` for async path, add missing `deinit()` in sync path - `JSTranspiler`: Use new protection mechanism instead of manual `protect()`/`unprotect()` calls - Simplify `createOnJSThread` signatures to not return errors (OOM is handled internally) - Update all callers to use renamed/simplified APIs ## Test plan - [x] Code review of all callsites to verify correct protect/unprotect pairing - [ ] Run existing zstd tests - [ ] Run existing scrypt tests - [ ] Run existing transpiler tests 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-19 17:30:26 -08:00
Jarred Sumner	ce5c336ea5	Revert "fix: memory leaks in IPC message handling (#25602 )" This reverts commit `05b12e0ed0`. The tests did not fail with system version of Bun.	2025-12-19 17:28:54 -08:00
robobun	05b12e0ed0	fix: memory leaks in IPC message handling (#25602 ) ## Summary - Add periodic memory reclamation for IPC buffers after processing messages - Fix missing `deref()` on `bun.String` created from `cmd` property in `handleIPCMessage` - Add `reclaimMemory()` function to shrink incoming buffer and send queue when they exceed 2MB capacity - Track message count to trigger memory reclamation every 256 messages The incoming `ByteList` buffer and send queue `ArrayList` would grow but never shrink, causing memory accumulation during sustained IPC messaging. ## Test plan - [x] Added regression tests in `test/js/bun/spawn/spawn-ipc-memory.test.ts` - [x] Existing IPC tests pass (`spawn.ipc.test.ts`) - [x] Existing cluster tests pass 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com>	2025-12-19 17:27:09 -08:00
Angus Comrie	d9459f8540	Fix postgres empty check when handling arrays (#25607 ) ### What does this PR do? Closes #25505. This adjusts the byte length check in `DataCell: fromBytes` to 12 bytes instead of 16, as zero-dimensional arrays will have a shorter preamble. ### How did you verify your code works? Test suite passes, and I've added a new test that fails in the main branch but passes with this change. The issue only seems to crop up when a connection is _reused_, which is curious.	2025-12-19 14:49:12 -08:00
Jarred Sumner	e79b512a9d	Propagate debugger CLI config in single-file executables (#25600 )	2025-12-19 09:49:02 -08:00
robobun	9902039b1f	fix: memory leaks in error-handling code for Brotli, Zstd, and Zlib compression state machines (#25592 ) ## Summary Fix several memory leaks in the compression libraries: - NativeBrotli/NativeZstd reset() - Each call to `reset()` allocated a new encoder/decoder without freeing the previous one - NativeBrotli/NativeZstd init() error paths - If `setParams()` failed after `stream.init()` succeeded, the instance was leaked - NativeZstd init() - If `setPledgedSrcSize()` failed after context creation, the context was leaked - ZlibCompressorArrayList - After `deflateInit2_()` succeeded, if `ensureTotalCapacityPrecise()` failed with OOM, zlib internal state was never freed - NativeBrotli close() - Now sets state to null to prevent potential double-free (defensive) - LibdeflateState - Added `deinit()` for API consistency ## Test plan - [x] Added regression test that calls `reset()` 100k times and measures memory growth - [x] Test shows memory growth dropped from ~600MB to ~10MB for Brotli - [x] Verified no double-frees by tracing code paths - [x] Existing zlib tests pass (except pre-existing timeout in debug build) Before fix (system bun 1.3.3): ``` Memory growth after 100000 reset() calls: 624.38 MB (BrotliCompress) Memory growth after 100000 reset() calls: 540.63 MB (BrotliDecompress) ``` After fix: ``` Memory growth after 100000 reset() calls: 11.84 MB (BrotliCompress) Memory growth after 100000 reset() calls: 0.16 MB (BrotliDecompress) ``` 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com>	2025-12-18 21:42:14 -08:00
Dylan Conway	f3fd7506ef	fix(windows): handle UV_UNKNOWN and UV_EAI_* error codes in libuv errno mapping (#25596 ) ## Summary - Add missing `UV_UNKNOWN` and `UV_EAI_` error code mappings to the `errno()` function in `ReturnCode` - Fixes panic "integer does not fit in destination type" on Windows when libuv returns unmapped error codes - Speculative fix for BUN-131E ## Root Cause The `errno()` function was missing mappings for `UV_UNKNOWN` (-4094) and all `UV_EAI_` address info errors (-3000 to -3014). When libuv returned these codes, the switch fell through to `else => null`, and the caller at `sys_uv.zig:317` assumed success and tried to cast the negative return code to `usize`, causing a panic. This was triggered in `readFileWithOptions` -> `preadv` when: - Memory-mapped file operations encounter exceptions (file modified/truncated by another process, network drive issues) - Windows returns error codes that libuv cannot map to standard errno values ## Crash Report ``` Bun v1.3.5 (`1e86ceb`) on windows x86_64baseline [] panic: integer does not fit in destination type sys_uv.zig:294: preadv node_fs.zig:5039: readFileWithOptions ``` ## Test plan - [ ] This fix prevents a panic, converting it to a proper error. Testing would require triggering `UV_UNKNOWN` from libuv, which is difficult to do reliably (requires memory-mapped file exceptions or unusual Windows errors). 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-18 21:41:41 -08:00
Dylan Conway	57cbbc09e4	fix: correct off-by-one bounds checks in bundler and package installer (#25582 ) ## Summary - Fix two off-by-one bounds check errors that used `>` instead of `>=` - Both bugs could cause undefined behavior (array out-of-bounds access) when an index equals the array length ## The Bugs ### 1. `src/install/postinstall_optimizer.zig:62` ```zig // Before (buggy): if (resolution > metas.len) continue; const meta: *const Meta = &metas[resolution]; // Out-of-bounds when resolution == metas.len // After (fixed): if (resolution >= metas.len) continue; ``` ### 2. `src/bundler/linker_context/doStep5.zig:10` ```zig // Before (buggy): if (id > c.graph.meta.len) return; const resolved_exports = &c.graph.meta.items(.resolved_exports)[id]; // Out-of-bounds when id == c.graph.meta.len // After (fixed): if (id >= c.graph.meta.len) return; ``` ## Why These Are Bugs Valid array indices are `0` to `len - 1`. When `index == len`: - `index > len` evaluates to `false` → check passes - `array[index]` accesses `array[len]` → out-of-bounds / undefined behavior ## Codebase Patterns The rest of the codebase correctly uses `>=` for these checks: - `lockfile.zig:484`: `if (old_resolution >= old.packages.len) continue;` - `lockfile.zig:522`: `if (old_resolution >= old.packages.len) continue;` - `LinkerContext.zig:389`: `if (source_index >= import_records_list.len) continue;` - `LinkerContext.zig:1667`: `if (source_index >= c.graph.ast.len) {` ## Test plan - [x] Verified fix aligns with existing codebase patterns - [ ] CI passes 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude <noreply@anthropic.com>	2025-12-18 18:04:28 -08:00
Dylan Conway	8941a363c3	fix: dupe `ca` string in .npmrc to prevent use-after-free (#25563 ) ## Summary - Fix use-after-free bug when parsing `ca` option from `.npmrc` - The `ca` string was being stored directly from the parser's arena without duplication - Since the parser arena is freed at the end of `loadNpmrc`, this created a dangling pointer ## The Bug In `src/ini.zig`, the `ca` string wasn't being duplicated like all other string properties: ```zig // Lines 983-986 explicitly warn about this: // Need to be very, very careful here with strings. // They are allocated in the Parser's arena, which of course gets // deinitialized at the end of the scope. // We need to dupe all strings // Line 981: Parser arena is freed here defer parser.deinit(); // Line 1016-1020: THE BUG - string not duped! if (out.asProperty("ca")) \|query\| { if (query.expr.asUtf8StringLiteral()) \|str\| { install.ca = .{ .str = str, // ← Dangling pointer after parser.deinit()! }; ``` All other string properties in the same function correctly duplicate: - `registry` (line 996): `try allocator.dupe(u8, str)` - `cache` (line 1002): `try allocator.dupe(u8, str)` - `cafile` (line 1037): `asStringCloned(allocator)` - `ca` array items (line 1026): `asStringCloned(allocator)` ## User Impact When a user has `ca=<certificate>` in their `.npmrc` file: 1. The certificate string is parsed and stored 2. The parser arena is freed 3. `install.ca.str` becomes a dangling pointer 4. Later TLS/SSL operations access freed memory 5. Could cause crashes, undefined behavior, or security issues ## Test plan - Code inspection confirms this matches the pattern used for all other string properties - The fix adds `try allocator.dupe(u8, str)` to match `cache`, `registry`, etc. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude <noreply@anthropic.com>	2025-12-17 19:56:25 -08:00
Dylan Conway	722ac3aa5a	fix: check correct variable in subprocess stdin cleanup (#25562 ) ## Summary - Fix typo in `onProcessExit` where `existing_stdin_value.isCell()` was checked instead of `existing_value.isCell()` - Since `existing_stdin_value` is always `.zero` at that point, the condition was always false, making the inner block dead code ## The Bug In `src/bun.js/api/bun/subprocess.zig:593`: ```zig var existing_stdin_value = jsc.JSValue.zero; // Line 590 - always .zero if (this_jsvalue != .zero) { if (jsc.Codegen.JSSubprocess.stdinGetCached(this_jsvalue)) \|existing_value\| { if (existing_stdin_value.isCell()) { // BUG! Should be existing_value // This block was DEAD CODE - never executed ``` Compare with the correct pattern used elsewhere: ```zig // shell/subproc.zig:251-252 (CORRECT) if (jsc.Codegen.JSSubprocess.stdinGetCached(subprocess.this_jsvalue)) \|existing_value\| { jsc.WebCore.FileSink.JSSink.setDestroyCallback(existing_value, 0); // Uses existing_value } ``` ## Impact The dead code prevented: - Recovery of stdin from cached JS value when `weak_file_sink_stdin_ptr` is null - Proper cleanup via `onAttachedProcessExit` on the FileSink - `setDestroyCallback` cleanup in `onProcessExit` Note: The user-visible impact was mitigated by redundant cleanup paths in `Writable.zig` that also call `setDestroyCallback`. ## Test plan - Code inspection confirms this is a straightforward typo fix - Existing subprocess tests continue to pass 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude <noreply@anthropic.com>	2025-12-17 18:34:58 -08:00
Dylan Conway	a333d02f84	fix: correct inverted buffer allocation logic in Postgres array parsing (#25564 ) ## Summary - Fix inverted buffer allocation logic when parsing strings in Postgres arrays - Strings larger than 16KB were incorrectly using the stack buffer instead of dynamically allocating - This caused spurious `InvalidByteSequence` errors for valid data ## The Bug In `src/sql/postgres/DataCell.zig`, the condition for when to use dynamic allocation was inverted: ```zig // BEFORE (buggy): const needs_dynamic_buffer = str_bytes.len < stack_buffer.len; // TRUE when SMALL // AFTER (fixed): const needs_dynamic_buffer = str_bytes.len > stack_buffer.len; // TRUE when LARGE ``` ## What happened with large strings (>16KB): 1. `needs_dynamic_buffer` = false (e.g., 20000 < 16384 is false) 2. Uses `stack_buffer[0..]` which is only 16KB 3. `unescapePostgresString` hits bounds check and returns `BufferTooSmall` 4. Error converted to `InvalidByteSequence` 5. User gets error even though data is valid ## User Impact Users with Postgres arrays containing JSON or string elements larger than 16KB would get spurious InvalidByteSequence errors even though their data was perfectly valid. ## Test plan - Code inspection confirms the logic was inverted - The fix aligns with the intended behavior: use stack buffer for small strings, dynamic allocation for large strings 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude <noreply@anthropic.com>	2025-12-17 18:34:17 -08:00
Dylan Conway	c1acb0b9a4	fix(shell): prevent double-close of fd when using &> redirect with builtins (#25568 ) ## Summary - Fix double-close of file descriptor when using `&>` redirect with shell builtin commands - Add `dupeRef()` helper for cleaner reference counting semantics - Add tests for `&>` and `&>>` redirects with builtins ## Test plan - [x] Added tests in `test/js/bun/shell/file-io.test.ts` that reproduce the bug - [x] All file-io tests pass ## The Bug When using `&>` to redirect both stdout and stderr to the same file with a shell builtin command (e.g., `pwd &> file.txt`), the code was creating two separate `IOWriter` instances that shared the same file descriptor. When both `IOWriter`s were destroyed, they both tried to close the same fd, causing an `EBADF` (bad file descriptor) error. ```javascript import { $ } from "bun"; await $`pwd &> output.txt`; // Would crash with EBADF ``` ## The Fix 1. Share a single `IOWriter` between stdout and stderr when both are redirected to the same file, with proper reference counting 2. Rename `refSelf` to `dupeRef` for clarity across `IOReader`, `IOWriter`, `CowFd`, and add it to `Blob` for consistency 3. Fix the `Body.Value` blob case to also properly reference count when the same blob is assigned to multiple outputs 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Latest model <noreply@anthropic.com>	2025-12-17 18:33:53 -08:00
190n	fa5a5bbe55	fix: v8::Value::IsInt32()/IsUint32() edge cases (#25548 ) ### What does this PR do? - fixes both functions returning false for double-encoded values (even if the numeric value is a valid int32/uint32) - fixes IsUint32() returning false for values that don't fit in int32 - fixes the test from #22462 not testing anything (the native functions were being passed a callback to run garbage collection as the first argument, so it was only ever testing what the type check APIs returned for that function) - extends the test to cover the first edge case above ### How did you verify your code works? The new tests fail without these fixes. --------- Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>	2025-12-17 00:52:16 -08:00
robobun	bc47f87450	fix(ini): support env var expansion in quoted .npmrc values (#25518 ) ## Summary Fixes environment variable expansion in quoted `.npmrc` values and adds support for the `?` optional modifier. ### Changes Simplified quoted value handling: - Removed unnecessary `isProperlyQuoted` check that added complexity without benefit - When JSON.parse succeeds for quoted strings, expand env vars in the result - When JSON.parse fails for single-quoted strings like `'${VAR}'`, still expand env vars Added `?` modifier support (matching npm behavior): - `${VAR}` - if VAR is undefined, leaves as `${VAR}` (no expansion) - `${VAR?}` - if VAR is undefined, expands to empty string This applies consistently to both quoted and unquoted values. ### Examples ```ini # Env var found - all expand to the value token = ${NPM_TOKEN} token = "${NPM_TOKEN}" token = '${NPM_TOKEN}' # Env var NOT found - left as-is token = ${NPM_TOKEN} # → ${NPM_TOKEN} token = "${NPM_TOKEN}" # → ${NPM_TOKEN} token = '${NPM_TOKEN}' # → ${NPM_TOKEN} # Optional modifier (?) - expands to empty if not found token = ${NPM_TOKEN?} # → (empty) token = "${NPM_TOKEN?}" # → (empty) auth = "Bearer ${TOKEN?}" # → Bearer ``` ### Test Plan - Added 8 new tests for the `?` modifier covering quoted and unquoted values - Verified all expected values match `npm config get` behavior - All 30 ini tests pass 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Jarred Sumner <jarred@jarredsumner.com> Co-authored-by: Dylan Conway <dylan.conway567@gmail.com>	2025-12-16 19:49:23 -08:00
robobun	b135c207ed	fix(yaml): remove YAML 1.1 legacy boolean values for YAML 1.2 compliance (#25537 ) ## Summary - Remove YAML 1.1 legacy boolean values (`yes/no/on/off/y/Y`) that are not part of the YAML 1.2 Core Schema - Keep YAML 1.2 Core Schema compliant values: `true/True/TRUE`, `false/False/FALSE`, `null/Null/NULL`, `0x` hex, `0o` octal - Add comprehensive roundtrip tests for YAML 1.2 compliance Removed (now parsed as strings): - `yes`, `Yes`, `YES` (were `true`) - `no`, `No`, `NO` (were `false`) - `on`, `On`, `ON` (were `true`) - `off`, `Off`, `OFF` (were `false`) - `y`, `Y` (were `true`) This fixes a common pain point where GitHub Actions workflow files with `on:` keys would have the key parsed as boolean `true` instead of the string `"on"`. ## YAML 1.2 Core Schema Specification From [YAML 1.2.2 Section 10.3.2 Tag Resolution](https://yaml.org/spec/1.2.2/#1032-tag-resolution): \| Regular expression \| Resolved to tag \| \|-------------------\|-----------------\| \| `null \\| Null \\| NULL \\| ~` \| tag:yaml.org,2002:null \| \| `/* Empty /` \| tag:yaml.org,2002:null \| \| `true \\| True \\| TRUE \\| false \\| False \\| FALSE` \| tag:yaml.org,2002:bool \| \| `[-+]? [0-9]+` \| tag:yaml.org,2002:int (Base 10) \| \| `0o [0-7]+` \| tag:yaml.org,2002:int (Base 8) \| \| `0x [0-9a-fA-F]+` \| tag:yaml.org,2002:int (Base 16) \| \| `[-+]? ( \. [0-9]+ \\| [0-9]+ ( \. [0-9] )? ) ( [eE] [-+]? [0-9]+ )?` \| tag:yaml.org,2002:float \| \| `[-+]? ( \.inf \\| \.Inf \\| \.INF )` \| tag:yaml.org,2002:float (Infinity) \| \| `\.nan \\| \.NaN \\| \.NAN` \| tag:yaml.org,2002:float (Not a number) \| Note: `yes`, `no`, `on`, `off`, `y`, `n` are not in the YAML 1.2 Core Schema boolean list. These were removed from YAML 1.1 as noted in [YAML 1.2 Section 1.2](https://yaml.org/spec/1.2.2/#12-yaml-history): > The YAML 1.2 specification was published in 2009. Its primary focus was making YAML a strict superset of JSON. It also removed many of the problematic implicit typing recommendations. ## Test plan - [x] Updated existing YAML tests to reflect YAML 1.2 Core Schema behavior - [x] Added roundtrip tests (stringify → parse) for YAML 1.2 compliance - [x] Verified tests fail with system Bun (YAML 1.1 behavior) and pass with debug build (YAML 1.2) - [x] Run `bun bd test test/js/bun/yaml/yaml.test.ts` 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com>	2025-12-16 14:29:39 -08:00
Ciro Spaciari	a1dd26d7db	fix(usockets) fix last_write_failed flag (#25496 ) https://github.com/oven-sh/bun/pull/25361 needs to be merged before this PR ## Summary - Move `last_write_failed` flag from loop-level to per-socket flag for correctness ## Changes - Move `last_write_failed` from `loop->data` to `socket->flags` - More semantically correct since write status is per-socket, not per-loop 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-16 14:26:42 -08:00
robobun	dd04c57258	feat: implement V8 Value type checking APIs (#22462 ) ## Summary This PR implements four V8 C++ API methods for type checking that are commonly used by native Node.js modules: - `v8::Value::IsMap()` - checks if value is a Map - `v8::Value::IsArray()` - checks if value is an Array - `v8::Value::IsInt32()` - checks if value is a 32-bit integer - `v8::Value::IsBigInt()` - checks if value is a BigInt ## Implementation Details The implementation maps V8's type checking APIs to JavaScriptCore's equivalent functionality: - `IsMap()` uses JSC's `inherits<JSC::JSMap>()` check - `IsArray()` uses JSC's `isArray()` function with the global object - `IsInt32()` uses JSC's `isInt32()` method - `IsBigInt()` uses JSC's `isBigInt()` method ## Changes - Added method declarations to `V8Value.h` - Implemented the methods in `V8Value.cpp` - Added symbol exports to `napi.zig` (both Unix and Windows mangled names) - Added symbols to `symbols.txt` and `symbols.dyn` - Added comprehensive tests in `v8-module/main.cpp` and `v8.test.ts` ## Testing The implementation has been verified to: - Compile successfully without errors - Export the correct symbols in the binary - Follow established patterns in the V8 compatibility layer Tests cover various value types including empty and populated Maps/Arrays, different numeric ranges, BigInts, and other JavaScript types. 🤖 Generated with [Claude Code](https://claude.ai/code) --------- Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: Jarred Sumner <jarred@jarredsumner.com>	2025-12-15 19:50:11 -08:00
robobun	344b2c1dfe	fix: Response.clone() no longer locks body when body was accessed before clone (#25484 ) ## Summary - Fix bug where `Response.clone()` would lock the original response's body when `response.body` was accessed before cloning - Apply the same fix to `Request.clone()` ## Root Cause When `response.body` was accessed before calling `response.clone()`, the original response's body would become locked after cloning. This happened because: 1. When the cloned response was wrapped with `toJS()`, `checkBodyStreamRef()` was called which moved the stream from `Locked.readable` to `js.gc.stream` and cleared `Locked.readable` 2. The subsequent code tried to get the stream from `Locked.readable`, which was now empty, so the body cache update was skipped 3. The JavaScript-level body property cache still held the old locked stream ## Fix Updated the cache update logic to: 1. For the cloned response: use `js.gc.stream.get()` instead of `Locked.readable.get()` since `toJS()` already moved the stream 2. For the original response: use `Locked.readable.get()` which still holds the teed stream since `checkBodyStreamRef` hasn't been called yet ## Reproduction ```javascript const readableStream = new ReadableStream({ start(controller) { controller.enqueue(new TextEncoder().encode("Hello, world!")); controller.close(); }, }); const response = new Response(readableStream); console.log(response.body?.locked); // Accessing body before clone const cloned = response.clone(); console.log(response.body?.locked); // Expected: false, Actual: true ❌ console.log(cloned.body?.locked); // Expected: false, Actual: false ✅ ``` ## Test plan - [x] Added regression tests for `Response.clone()` in `test/js/web/fetch/response.test.ts` - [x] Added regression test for `Request.clone()` in `test/js/web/request/request.test.ts` - [x] Verified tests fail with system bun (before fix) and pass with debug build (after fix) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Jarred Sumner <jarred@jarredsumner.com>	2025-12-15 18:46:02 -08:00
Ciro Spaciari	aef0b5b4a6	fix(usockets): safely handle socket reallocation during context adoption (#25361 ) ## Summary - Fix use-after-free vulnerability during socket adoption by properly tracking reallocated sockets - Add safety checks to prevent linking closed sockets to context lists - Properly track socket state with new `is_closed`, `adopted`, and `is_tls` flags ## What does this PR do? This PR improves event loop stability by addressing potential use-after-free issues that can occur when sockets are reallocated during adoption (e.g., when upgrading a TCP socket to TLS). ### Key Changes Socket State Tracking ([internal.h](packages/bun-usockets/src/internal/internal.h)) - Added `is_closed` flag to explicitly track when a socket has been closed - Added `adopted` flag to mark sockets that were reallocated during context adoption - Added `is_tls` flag to track TLS socket state for proper low-priority queue handling Safe Socket Adoption ([context.c](packages/bun-usockets/src/context.c)) - When `us_poll_resize()` returns a new pointer (reallocation occurred), the old socket is now: - Marked as closed (`is_closed = 1`) - Added to the closed socket cleanup list - Marked as adopted (`adopted = 1`) - Has its `prev` pointer set to the new socket for event redirection - Added guards to `us_internal_socket_context_link_socket/listen_socket/connecting_socket` to prevent linking already-closed sockets Event Loop Handling ([loop.c](packages/bun-usockets/src/loop.c)) - After callbacks that can trigger socket adoption (`on_open`, `on_writable`, `on_data`), the event loop now checks if the socket was reallocated and redirects to the new socket - Low-priority socket handling now properly checks `is_closed` state and uses `is_tls` flag for correct SSL handling Poll Resize Safety ([epoll_kqueue.c](packages/bun-usockets/src/eventing/epoll_kqueue.c)) - Changed `us_poll_resize()` to always allocate new memory with `us_calloc()` instead of `us_realloc()` to ensure the old pointer remains valid for cleanup - Now takes `old_ext_size` parameter to correctly calculate memory sizes - Re-enabled `us_internal_loop_update_pending_ready_polls()` call in `us_poll_change()` to ensure pending events are properly redirected ### How did you verify your code works? Run existing CI and existing socket upgrade tests under asan build	2025-12-15 18:43:51 -08:00
robobun	740fb23315	fix(windows): improve bunx metadata validation (#25012 ) ## Summary - Improved validation for bunx metadata files on Windows - Added graceful error handling for malformed metadata instead of crashing - Added regression test for the fix ## Test plan - [x] Run `bun bd test test/cli/install/bunx.test.ts -t "should not crash on corrupted"` - [x] Manual testing with corrupted `.bunx` files - [x] Verified normal operation still works 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Jarred Sumner <jarred@jarredsumner.com>	2025-12-15 18:37:09 -08:00
robobun	2dd997c4b5	fix(node): support duplicate dlopen calls with DLHandleMap (#24404 ) ## Summary Fixes an issue where loading the same native module (NODE_MODULE_CONTEXT_AWARE) multiple times would fail with: ``` symbol 'napi_register_module_v1' not found in native module ``` Fixes https://github.com/oven-sh/bun/issues/23136 Fixes https://github.com/oven-sh/bun/issues/21432 ## Root Cause When a native module is loaded for the first time: 1. `dlopen()` loads the shared library 2. Static constructors run and call `node_module_register()` 3. The module registers successfully On subsequent loads of the same module: 1. `dlopen()` returns the same handle (library already loaded) 2. Static constructors do not run again 3. No registration occurs, leading to the "symbol not found" error ## Solution Implemented a thread-safe `DLHandleMap` to cache and replay module registrations: 1. Thread-local storage captures the `node_module` during static constructor execution 2. After successful first load, save the registration to the global map 3. On subsequent loads*, look up the cached registration and replay it This approach matches Node.js's `global_handle_map` implementation. ## Changes - Created `src/bun.js/bindings/DLHandleMap.h` - thread-safe singleton cache - Added thread-local storage in `src/bun.js/bindings/v8/node.cpp` - Modified `src/bun.js/bindings/BunProcess.cpp` to save/lookup cached modules - Also includes the exports fix (using `toObject()` to match Node.js behavior) ## Test Plan Added `test/js/node/process/dlopen-duplicate-load.test.ts` with tests that: - Build a native addon using node-gyp - Load it twice with `process.dlopen` - Verify both loads succeed - Test with different exports objects All tests pass. ## Related Issue Fixes the second bug discovered in the segfault investigation. --------- Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: Jarred Sumner <jarred@jarredsumner.com>	2025-12-15 18:35:26 -08:00
robobun	4061e1cb4f	fix: handle EINVAL from copy_file_range on eCryptfs (#25534 ) ## Summary - Add `EINVAL` and `OPNOTSUPP` to the list of errors that trigger fallback from `copy_file_range` to `sendfile`/read-write loop - Fixes `Bun.write` and `fs.copyFile` failing on eCryptfs filesystems ## Test plan - [x] Existing `copyFile` tests pass (`bun bd test test/js/node/fs/fs.test.ts -t "copyFile"`) - [x] Existing `copy_file_range` fallback tests pass (`bun bd test test/js/bun/io/bun-write.test.js -t "should work when copyFileRange is not available"`) Fixes #13968 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Jarred Sumner <jarred@jarredsumner.com>	2025-12-15 17:47:08 -08:00
robobun	6386eef8aa	fix(bunx): handle empty string arguments on Windows (#25025 ) ## Summary Fixes #13316 Fixes #18275 Running `bunx cowsay ""` (or any package with an empty string argument) on Windows caused a panic. Additionally, `bunx concurrently "command with spaces"` was splitting quoted arguments incorrectly. Repro #13316: ```bash bunx cowsay "" # panic(main thread): reached unreachable code ``` Repro #18275: ```bash bunx concurrently "bun --version" "bun --version" # Only runs once, arguments split incorrectly # Expected: ["bun --version", "bun --version"] # Actual: ["bun", "--version", "bun", "--version"] ``` ## Root Cause The bunx fast path on Windows bypasses libuv and calls `CreateProcessW` directly to save 5-12ms. The command line building logic had two issues: 1. Empty strings: Not quoted at all, resulting in invalid command line 2. Arguments with spaces: Not quoted, causing them to be split into multiple arguments ## Solution Implement Windows command-line argument quoting using libuv's proven algorithm: - Port of libuv's `quote_cmd_arg` function (process backwards + reverse) - Empty strings become `""` - Strings with spaces/tabs/quotes are wrapped in quotes - Backslashes before quotes are properly escaped per Windows rules Why not use libuv directly? - Normal `Bun.spawn()` uses `uv_spawn()` which handles quoting internally - bunx fast path bypasses libuv to save 5-12ms (calls `CreateProcessW` directly) - libuv's `quote_cmd_arg` is a static function (not exported) - Solution: port the algorithm to Zig ## Test Plan - [x] Added regression test for empty strings (#13316) - [x] Added regression test for arguments with spaces (#18275) - [x] Verified system bun (v1.3.3) fails both tests - [x] Verified fix passes both tests - [x] Implementation based on battle-tested libuv algorithm 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>	2025-12-15 17:29:04 -08:00
robobun	3394fd3bdd	fix(node:url): return empty string for invalid domains in domainToASCII/domainToUnicode (#25196 ) ## Summary - Fixes `url.domainToASCII` and `url.domainToUnicode` to return empty string instead of throwing `TypeError` when given invalid domains - Per Node.js docs: "if `domain` is an invalid domain, the empty string is returned" ## Test plan - [x] Run `bun bd test test/regression/issue/24191.test.ts` - all 2 tests pass - [x] Verify tests fail with system Bun (`USE_SYSTEM_BUN=1`) to confirm fix validity - [x] Manual verification: `url.domainToASCII('xn--iñvalid.com')` returns `""` ## Example Before (bug): ``` $ bun -e "import url from 'node:url'; console.log(url.domainToASCII('xn--iñvalid.com'))" TypeError: domainToASCII failed ``` After (fixed): ``` $ bun -e "import url from 'node:url'; console.log(url.domainToASCII('xn--iñvalid.com'))" (empty string output) ``` Closes #24191 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com>	2025-12-15 17:26:32 -08:00
robobun	8dc79641c8	fix(http): support proxy passwords longer than 4096 characters (#25530 ) ## Summary - Fixes silent 401 Unauthorized errors when using proxies with long passwords (e.g., JWT tokens > 4096 chars) - Bun was silently dropping proxy passwords exceeding 4095 characters, falling through to code that only encoded the username ## Changes - Added `PercentEncoding.decodeWithFallback` which uses a 4KB stack buffer for the common case and falls back to heap allocation only for larger inputs - Updated proxy auth encoding in `AsyncHTTP.zig` to use the new fallback method ## Test plan - [x] Added test case that verifies passwords > 4096 chars are handled correctly - [x] Test fails with system bun (v1.3.3), passes with this fix - [x] All 29 proxy tests pass 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com>	2025-12-15 13:21:41 -08:00
robobun	d865ef41e2	feat: add Bun.Terminal API for pseudo-terminal (PTY) support (#25415 ) ## Summary This PR adds a new `Bun.Terminal` API for creating and managing pseudo-terminals (PTYs), enabling interactive terminal applications in Bun. ### Features - Standalone Terminal: Create PTYs directly with `new Bun.Terminal(options)` - Spawn Integration: Spawn processes with PTY attached via `Bun.spawn({ terminal: options })` - Full PTY Control: Write data, resize, set raw mode, and handle callbacks ## Examples ### Basic Terminal with Spawn (Recommended) ```typescript const proc = Bun.spawn(["bash"], { terminal: { cols: 80, rows: 24, data(terminal, data) { // Handle output from the terminal process.stdout.write(data); }, exit(terminal, code, signal) { console.log(`Process exited with code ${code}`); }, }, }); // Write commands to the terminal proc.terminal.write("echo Hello from PTY!\n"); proc.terminal.write("exit\n"); await proc.exited; proc.terminal.close(); ``` ### Interactive Shell ```typescript // Create an interactive shell that mirrors to stdout const proc = Bun.spawn(["bash", "-i"], { terminal: { cols: process.stdout.columns \|\| 80, rows: process.stdout.rows \|\| 24, data(term, data) { process.stdout.write(data); }, }, }); // Forward stdin to the terminal process.stdin.setRawMode(true); for await (const chunk of process.stdin) { proc.terminal.write(chunk); } ``` ### Running Interactive Programs (vim, htop, etc.) ```typescript const proc = Bun.spawn(["vim", "file.txt"], { terminal: { cols: process.stdout.columns, rows: process.stdout.rows, data(term, data) { process.stdout.write(data); }, }, }); // Handle terminal resize process.stdout.on("resize", () => { proc.terminal.resize(process.stdout.columns, process.stdout.rows); }); // Forward input process.stdin.setRawMode(true); for await (const chunk of process.stdin) { proc.terminal.write(chunk); } ``` ### Capturing Colored Output ```typescript const chunks: Uint8Array[] = []; const proc = Bun.spawn(["ls", "--color=always"], { terminal: { data(term, data) { chunks.push(data); }, }, }); await proc.exited; proc.terminal.close(); // Output includes ANSI color codes const output = Buffer.concat(chunks).toString(); console.log(output); ``` ### Standalone Terminal (Advanced) ```typescript const terminal = new Bun.Terminal({ cols: 80, rows: 24, data(term, data) { console.log("Received:", data.toString()); }, }); // Use terminal.stdin as the fd for child process stdio const proc = Bun.spawn(["bash"], { stdin: terminal.stdin, stdout: terminal.stdin, stderr: terminal.stdin, }); terminal.write("echo hello\n"); // Clean up terminal.close(); ``` ### Testing TTY Detection ```typescript const proc = Bun.spawn([ "bun", "-e", "console.log('isTTY:', process.stdout.isTTY)" ], { terminal: {}, }); // Output: isTTY: true ``` ## API ### `Bun.spawn()` with `terminal` option ```typescript const proc = Bun.spawn(cmd, { terminal: { cols?: number, // Default: 80 rows?: number, // Default: 24 name?: string, // Default: "xterm-256color" data?: (terminal: Terminal, data: Uint8Array) => void, exit?: (terminal: Terminal, code: number, signal: string \| null) => void, drain?: (terminal: Terminal) => void, } }); // Access the terminal proc.terminal.write(data); proc.terminal.resize(cols, rows); proc.terminal.setRawMode(enabled); proc.terminal.close(); // Note: proc.stdin, proc.stdout, proc.stderr return null when terminal is used ``` ### `new Bun.Terminal(options)` ```typescript const terminal = new Bun.Terminal({ cols?: number, rows?: number, name?: string, data?: (terminal, data) => void, exit?: (terminal, code, signal) => void, drain?: (terminal) => void, }); terminal.stdin; // Slave fd (for child process) terminal.stdout; // Master fd (for reading) terminal.closed; // boolean terminal.write(data); terminal.resize(cols, rows); terminal.setRawMode(enabled); terminal.ref(); terminal.unref(); terminal.close(); await terminal[Symbol.asyncDispose](); ``` ## Implementation Details - Uses `openpty()` to create pseudo-terminal pairs - Properly manages file descriptor lifecycle with reference counting - Integrates with Bun's event loop via `BufferedReader` and `StreamingWriter` - Supports `await using` syntax for automatic cleanup - POSIX only (Linux, macOS) - not available on Windows ## Test Results - 80 tests passing - Covers: construction, writing, reading, resize, raw mode, callbacks, spawn integration, error handling, GC safety ## Changes - `src/bun.js/api/bun/Terminal.zig` - Terminal implementation - `src/bun.js/api/bun/Terminal.classes.ts` - Class definition for codegen - `src/bun.js/api/bun/subprocess.zig` - Added terminal field and getter - `src/bun.js/api/bun/js_bun_spawn_bindings.zig` - Terminal option parsing - `src/bun.js/api/BunObject.classes.ts` - Terminal getter on Subprocess - `packages/bun-types/bun.d.ts` - TypeScript types - `docs/runtime/child-process.mdx` - Documentation - `test/js/bun/terminal/terminal.test.ts` - Comprehensive tests 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: Jarred Sumner <jarred@jarredsumner.com>	2025-12-15 12:51:13 -08:00
robobun	8698d25c52	fix: ensure TLS handshake callback fires before HTTP request handler (#25525 ) ## Summary Fixes a flaky test (`test-http-url.parse-https.request.js`) where `request.socket._secureEstablished` was intermittently `false` when the HTTP request handler was called on HTTPS servers. ## Root Cause The `isAuthorized` flag was stored in `HttpContextData::flags.isAuthorized`, which is shared across all sockets in the same context. This meant multiple concurrent TLS connections could overwrite each other's authorization state, and the value could be stale when read. ## Fix Moved the `isAuthorized` flag from the context-level `HttpContextData` to the per-socket `AsyncSocketData` base class. This ensures each socket has its own authorization state that is set correctly during its TLS handshake callback. ## Changes - `AsyncSocketData.h`: Added per-socket `bool isAuthorized` field - `HttpContext.h`: Updated handshake callback to set per-socket flag instead of context-level flag - `JSNodeHTTPServerSocket.cpp`: Updated `isAuthorized()` to read from per-socket `AsyncSocketData` (via `HttpResponseData` which inherits from it) ## Testing Ran the flaky test 50+ times with 100% pass rate. Also verified gRPC and HTTP2 tests still pass. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com>	2025-12-15 12:44:26 -08:00
robobun	7dcd49f832	fix(install): only apply default trusted dependencies to npm packages (#25163 ) ## Summary - The default trusted dependencies list should only apply to packages installed from npm - Non-npm sources (file:, link:, git:, github:) now require explicit trustedDependencies - This prevents malicious packages from spoofing trusted names through local paths or git repos ## Test plan - [x] Added test: file: dependency named "esbuild" does NOT auto-run postinstall scripts - [x] Added test: file: dependency runs scripts when explicitly added to trustedDependencies - [x] Verified tests fail with system bun (old behavior) and pass with new build - [x] Build compiles successfully 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: Jarred Sumner <jarred@jarredsumner.com> Co-authored-by: Dylan Conway <dylan.conway567@gmail.com>	2025-12-11 17:44:41 -08:00
robobun	c59a6997cd	feat(bundler): add statically-analyzable dead-code elimination via feature flags (#25462 ) ## Summary - Adds `import { feature } from "bun:bundle"` for compile-time feature flag checking - `feature("FLAG_NAME")` calls are replaced with `true`/`false` at bundle time - Enables dead-code elimination through `--feature=FLAG_NAME` CLI argument - Works in `bun build`, `bun run`, and `bun test` - Available in both CLI and `Bun.build()` JavaScript API ## Usage ```ts import { feature } from "bun:bundle"; if (feature("SUPER_SECRET")) { console.log("Secret feature enabled!"); } else { console.log("Normal mode"); } ``` ### CLI ```bash # Enable feature during build bun build --feature=SUPER_SECRET index.ts # Enable at runtime bun run --feature=SUPER_SECRET index.ts # Enable in tests bun test --feature=SUPER_SECRET ``` ### JavaScript API ```ts await Bun.build({ entrypoints: ['./index.ts'], outdir: './out', features: ['SUPER_SECRET', 'ANOTHER_FLAG'], }); ``` ## Implementation - Added `bundler_feature_flags` (as `*const bun.StringSet`) to `RuntimeFeatures` and `BundleOptions` - Added `bundler_feature_flag_ref` to Parser struct to track the `feature` import - Handle `bun:bundle` import at parse time (similar to macros) - capture ref, return empty statement - Handle `feature()` calls in `e_call` visitor - replace with boolean based on flags - Wire feature flags through CLI arguments and `Bun.build()` API to bundler options - Added `features` option to `JSBundler.zig` for JavaScript API support - Added TypeScript types in `bun.d.ts` - Added documentation to `docs/bundler/index.mdx` ## Test plan - [x] Basic feature flag enabled/disabled tests (both CLI and API backends) - [x] Multiple feature flags test - [x] Dead code elimination verification tests - [x] Error handling for invalid arguments - [x] Runtime tests with `bun run --feature=FLAG` - [x] Test runner tests with `bun test --feature=FLAG` - [x] Aliased import tests (`import { feature as checkFeature }`) - [x] Ternary operator DCE tests - [x] Tests use `itBundled` with both `backend: "cli"` and `backend: "api"` 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: Alistair Smith <hi@alistair.sh> Co-authored-by: Jarred Sumner <jarred@jarredsumner.com>	2025-12-11 17:44:14 -08:00
Jarred Sumner	98cee5a57e	Improve Bun.stringWidth accuracy and robustness (#25447 ) This PR significantly improves `Bun.stringWidth` to handle a wider variety of Unicode characters and escape sequences correctly. ## Zero-width character handling Added support for many previously unhandled zero-width characters: - Soft hyphen (U+00AD) - Word joiner and invisible operators (U+2060-U+2064) - Lone surrogates (U+D800-U+DFFF) - Arabic formatting characters (U+0600-U+0605, U+06DD, U+070F, U+08E2) - Indic script combining marks (Devanagari through Malayalam) - Thai and Lao combining marks - Combining Diacritical Marks Extended and Supplement - Tag characters (U+E0000-U+E007F) ## ANSI escape sequence handling ### CSI sequences - Now properly handles ALL CSI final bytes (0x40-0x7E), not just `m` - This means cursor movement (A/B/C/D), erase (J/K), scroll (S/T), and other CSI commands are now correctly excluded from width calculation ### OSC sequences - Added support for OSC sequences (ESC ] ... BEL/ST) - OSC 8 hyperlinks are now properly handled - Supports both BEL (0x07) and ST (ESC \) terminators ### ESC ESC fix - Fixed state machine bug where `ESC ESC` would incorrectly reset state - Now correctly handles consecutive ESC characters ## Emoji handling Added proper grapheme-aware emoji width calculation: - Flag emoji (regional indicator pairs) → width 2 - Skin tone modifiers → width 2 - ZWJ sequences (family, professions, etc.) → width 2 - Keycap sequences → width 2 - Variation selectors (VS15 for text, VS16 for emoji presentation) - Uses ICU's `UCHAR_EMOJI` property for accurate emoji detection ## Test coverage Added comprehensive test suite with 94 tests covering: - All zero-width character categories - All CSI final bytes - OSC sequences with various terminators - Emoji edge cases (flags, skin tones, ZWJ, keycaps, variation selectors) - East Asian width (CJK, fullwidth, halfwidth katakana) - Indic and Thai script combining marks - Fuzzer-like stress tests for robustness ## Breaking changes This is a behavior change - `stringWidth` will return different values for some inputs. However, the new values are more accurate representations of terminal display width: \| Input \| Old \| New \| Why \| \|-------\|-----\|-----\|-----\| \| Flag emoji 🇺🇸 \| 1 \| 2 \| Flags display as 2 cells \| \| Skin tone 👋🏽 \| 4 \| 2 \| Emoji + modifier = 1 grapheme \| \| ZWJ family 👨‍👩‍👧 \| 8 \| 2 \| ZWJ sequence = 1 grapheme \| \| Word joiner U+2060 \| 1 \| 0 \| Invisible character \| \| OSC 8 hyperlinks \| counted URL \| just visible text \| URLs are invisible \| \| Cursor movement ESC[5A \| counted \| 0 \| Control sequence \| 🤖 Generated with [Claude Code](https://claude.ai/code) --------- Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Claude Bot <claude-bot@bun.sh>	2025-12-10 16:17:57 -08:00
robobun	a2d8b75962	fix(yaml): quote strings ending with colons (#25443 ) ## Summary - Fixes strings ending with colons (e.g., `"tin:"`) not being quoted in YAML.stringify output - This caused YAML.parse to fail with "Unexpected token" when parsing the output back ## Test plan - Added regression tests in `test/regression/issue/25439.test.ts` - Verified round-trip works for various strings ending with colons - Ran existing YAML tests to ensure no regressions Fixes #25439 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com>	2025-12-09 18:20:26 -08:00

1 2 3 4 5 ...

9100 Commits