bun.sh

mirror of https://github.com/oven-sh/bun synced 2026-02-11 19:38:58 +00:00

Author	SHA1	Message	Date
Jarred Sumner	22315000e0	Update CLAUDE.md	2026-01-07 12:33:21 -08:00
Dylan Conway	4c492c66b8	fix(bundler): fix --compile with 8+ embedded files (#25859 ) ## Summary Fixes #20821 When `bun build --compile` was used with 8 or more embedded files, the compiled binary would silently fail to execute any code (exit code 0, no output). Root cause: Chunks were sorted alphabetically by their `entry_bits` key bytes. For entry point 0, the key starts with bit 0 set (byte pattern `0x01`), but for entry point 8, the key has bit 8 set in the byte (pattern `0x00, 0x01`). Alphabetically, `0x00 < 0x01`, so entry point 8's chunk sorted before entry point 0. This caused the wrong entry point to be identified as the main entry, resulting in asset wrapper code being executed instead of the user's code. Fix: Custom sort that ensures `entry_point_id=0` (the main entry point) always sorts first, with remaining chunks sorted alphabetically for determinism. ## Test plan - Added regression test `compile/ManyEmbeddedFiles` that embeds 8 files and verifies the main entry point runs correctly - Verified manually with reproduction case from issue 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-01-06 23:05:01 +00:00
Jack Kleeman	46801ec926	Send http2 window resize frames after half the window is used up (#25847 ) ### What does this PR do? It is standard practice to send HTTP2 window resize frames once half the window is used up: - [nghttp2](https://en.wikipedia.org/wiki/Nghttp2#HTTP/2_implementation) - [Go](https://github.com/golang/net/blob/master/http2/flow.go#L31) - [Rust h2](https://github.com/hyperium/h2/blob/master/src/proto/streams/flow_control.rs#L17) The current behaviour of Bun is to send a window update once the client has sent 65535 bytes exactly. This leads to an interruption in throughput while the window update is received. This is not just a performance concern however, I think some clients will not handle this well, as it means that if you stop sending data even 1 byte before 65535, you have a deadlock. The reason I came across this issue is that it appears that the Rust `hyper` crate always reserves an additional 1 byte from the connection for each http2 stream (https://github.com/hyperium/hyper/blob/master/src/proto/h2/mod.rs#L122). This means that when you have two concurrent requests, the client treats it like the window is exhausted when it actually has a byte remaining, leading to a sequential dependency between the requests that can create deadlocks if they depend on running concurrently. I accept this is not a problem with bun, but its a happy accident that we can resolve such off-by-one issues by increasing the window size once it is 50% utilized ### How did you verify your code works? Using wireshark, bun debug logging, and client logging I can observe that the window updates are now sent after 32767 bytes. This also resolves the h2 crate client issue.	2026-01-06 11:37:50 -08:00
Jarred Sumner	c6a73fc23e	Hardcode __NR_close_range (#25840 ) ### What does this PR do? Fixes https://github.com/oven-sh/bun/issues/25839 ### How did you verify your code works?	2026-01-06 15:07:19 +00:00
Andrew Johnston	3de2dc1287	fix: use the computed size of the `Offsets` struct instead of hard-co… (#25816 ) ### What does this PR do? - I was trying to understand how the SEA bundling worked in Bun and noticed the size of the `Offsets` struct is hard-coded here to 32. It should use the computed size to be future proof to changes in the schema. ### How did you verify your code works? - I didn't. Can add tests if this is not covered by existing tests. ChatGPT agreed with me though. =)	2026-01-06 15:04:28 +00:00
SUZUKI Sosuke	370e6fb9fa	fix(fetch): fix ReadableStream memory leak when using stream body (#25846 ) ## Summary This PR fixes a memory leak that occurs when `fetch()` is called with a `ReadableStream` body. The ReadableStream objects were not being properly released, causing them to accumulate in memory. ## Problem When using `fetch()` with a ReadableStream body: ```javascript const stream = new ReadableStream({ start(controller) { controller.enqueue(new TextEncoder().encode("data")); controller.close(); } }); await fetch(url, { method: "POST", body: stream }); ``` The ReadableStream objects leak because `FetchTasklet.clearData()` has a conditional that prevents `detach()` from being called on ReadableStream request bodies after streaming has started. ### Root Cause The problematic condition in `clearData()`: ```zig if (this.request_body != .ReadableStream or this.is_waiting_request_stream_start) { this.request_body.detach(); } ``` After `startRequestStream()` is called: - `is_waiting_request_stream_start` becomes `false` - `request_body` is still `.ReadableStream` - The condition evaluates to `(false or false) = false` - `detach()` is skipped → memory leak ### Why the Original Code Was Wrong The original code appears to assume that when `startRequestStream()` is called, ownership of the Strong reference is transferred to `ResumableSink`. However, this is incorrect: 1. `startRequestStream()` creates a new independent Strong reference in `ResumableSink` (see `ResumableSink.zig:119`) 2. The FetchTasklet's original reference is not transferred - it becomes redundant 3. Strong references in Bun are independent - calling `deinit()` on one does not affect the other ## Solution Remove the conditional and always call `detach()`: ```zig // Always detach request_body regardless of type. // When request_body is a ReadableStream, startRequestStream() creates // an independent Strong reference in ResumableSink, so FetchTasklet's // reference becomes redundant and must be released to avoid leaks. this.request_body.detach(); ``` ### Safety Analysis This change is safe because: 1. Strong references are independent: Each Strong reference maintains its own ref count. Detaching FetchTasklet's reference doesn't affect ResumableSink's reference 2. Idempotency: `detach()` is safe to call on already-detached references 3. Timing: `clearData()` is only called from `deinit()` after streaming has completed (ref_count = 0) 4. No UAF risk: `deinit()` only runs when ref_count reaches 0, which means all streaming operations have completed ## Test Results Before fix (with system Bun): ``` Expected: <= 100 Received: 501 (Request objects leaked) Received: 1002 (ReadableStream objects leaked) ``` After fix: ``` 6 pass 0 fail ``` ## Test Coverage Added comprehensive tests in `test/js/web/fetch/fetch-cyclic-reference.test.ts` covering: - Response stream leaks with cyclic references - Streaming response body leaks - Request body stream leaks with cyclic references - ReadableStream body leaks (no cyclic reference needed to reproduce) - Concurrent fetch operations with cyclic references --------- Co-authored-by: Jarred Sumner <jarred@jarredsumner.com> Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com>	2026-01-06 15:00:52 +00:00
Prithvish Baidya	9ab6365a13	Add support for Requester Pays in S3 operations (#25514 ) - Introduced `requestPayer` option in S3-related functions and structures to handle Requester Pays buckets. - Updated S3 client methods to accept and propagate the `requestPayer` flag. - Enhanced documentation for the `requestPayer` option in the S3 type definitions. - Adjusted existing S3 operations to utilize the `requestPayer` parameter where applicable, ensuring compatibility with AWS S3's Requester Pays feature. - Ensured that the new functionality is integrated into multipart uploads and simple requests. ### What does this PR do? This change allows users to specify whether they are willing to pay for data transfer costs when accessing objects in Requester Pays buckets, improving flexibility and compliance with AWS S3's billing model. This closes #25499 ### How did you verify your code works? I have added a new test file to verify this functionality, and all my tests pass. I also tested this against an actual S3 bucket which can only be accessed if requester pays. I can confirm that it's accessible with `requestPayer` is `true`, and the default of `false` does not allow access. An example bucket is here: s3://hl-mainnet-evm-blocks/0/0/1.rmp.lz4 (my usecase is indexing [hyperliquid block data](https://hyperliquid.gitbook.io/hyperliquid-docs/for-developers/hyperevm/raw-hyperevm-block-data) which is stored in s3, and I want to use bun to index faster) --------- Co-authored-by: Alistair Smith <hi@alistair.sh> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: Ciro Spaciari <ciro.spaciari@gmail.com>	2026-01-05 15:04:20 -08:00
Alistair Smith	bf937f7294	sql: filter out undefined values in INSERT helper instead of treating as NULL (#25830 ) ### What does this PR do? the `sql()` helper now filters out `undefined` values in INSERT statements instead of converting them to `NULL`. This allows columns with `DEFAULT` values to use their defaults when `undefined` is passed, rather than being overridden with `NULL`. Before: `sql({ foo: undefined, id: "123" })` in INSERT would generate `(foo, id) VALUES (NULL, "123")`, causing NOT NULL constraint violations even when the column has a DEFAULT. After: `sql({ foo: undefined, id: "123" })` in INSERT generates `(id) VALUES ("123")`, omitting the undefined column entirely and letting the database use the DEFAULT value. Also fixes a data loss bug in bulk inserts where columns were determined only from the first item - now all items are checked, so values in later items aren't silently dropped. Fixes #25829 ### How did you verify your code works? - Added regression test for #25829 (NOT NULL column with DEFAULT) - Added tests for bulk insert with mixed undefined patterns which is the data loss scenario	2026-01-05 15:03:20 -08:00
robobun	4301af9f3e	Harden TLS hostname verification (#25727 ) ## Summary - Tighten wildcard certificate matching logic for improved security - Add tests for wildcard hostname verification edge cases ## Test plan - [x] `bun bd test test/js/web/fetch/fetch.tls.wildcard.test.ts` passes - [x] Existing TLS tests continue to pass 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com>	2026-01-05 10:21:49 -08:00
Darwin ❤️❤️❤️	27ff6aaae0	fix(web): make URLSearchParams.prototype.size configurable (#25762 )	2026-01-02 04:57:48 -08:00
robobun	779764332a	feat(cli): add `--grep` as alias for `-t`/`--test-name-pattern` in `bun test` (#25788 )	2026-01-02 04:52:47 -08:00
robobun	604c83c8a6	perf(ipc): fix O(n²) JSON scanning for large chunked messages (#25743 ) ## Summary - Fix O(n²) performance bug in JSON mode IPC when receiving large messages that arrive in chunks - Add `JsonIncomingBuffer` wrapper that tracks newline positions to avoid re-scanning - Each byte is now scanned exactly once (on arrival or when preceding message is consumed) ## Problem When data arrives in chunks in JSON mode, `decodeIPCMessage` was calling `indexOfChar(data, '\n')` on the ENTIRE accumulated buffer every time. For a 10MB message arriving in 160 chunks of 64KB: - Chunk 1: scan 64KB - Chunk 2: scan 128KB - Chunk 3: scan 192KB - ... - Chunk 160: scan 10MB Total: ~800MB scanned for one 10MB message. ## Solution Introduced a `JsonIncomingBuffer` struct that: 1. Tracks `newline_pos: ?u32` - position of known upcoming newline (if any) 2. On `append(bytes)`: Only scans new chunk for `\n` if no position is cached 3. On `consume(bytes)`: Updates or re-scans as needed after message processing This ensures O(n) scanning instead of O(n²). ## Test plan - [x] `bun run zig:check-all` passes (all platforms compile) - [x] `bun bd test test/js/bun/spawn/spawn.ipc.test.ts` - 4 tests pass - [x] `bun bd test test/js/node/child_process/child_process_ipc.test.js` - 1 test pass - [x] `bun bd test test/js/bun/spawn/bun-ipc-inherit.test.ts` - 1 test pass - [x] `bun bd test test/js/bun/spawn/spawn.ipc.bun-node.test.ts` - 1 test pass - [x] `bun bd test test/js/bun/spawn/spawn.ipc.node-bun.test.ts` - 1 test pass - [x] `bun bd test test/js/node/child_process/child_process_ipc_large_disconnect.test.js` - 1 test pass - [x] Manual verification with `child-process-send-cb-more.js` (32KB messages) 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: Jarred Sumner <jarred@jarredsumner.com>	2025-12-29 20:02:18 -08:00
robobun	370b25c086	perf(Buffer.indexOf): use SIMD-optimized search functions (#25745 ) ## Summary Optimize `Buffer.indexOf` and `Buffer.includes` by replacing `std::search` with Highway SIMD-optimized functions: - Single-byte search: `highway_index_of_char` - SIMD-accelerated character search - Multi-byte search: `highway_memmem` - SIMD-accelerated substring search These Highway functions are already used throughout Bun's codebase for fast string searching (URL parsing, JS lexer, etc.) and provide significant speedups for pattern matching in large buffers. ### Changes ```cpp // Before: std::search (scalar) auto it = std::search(haystack.begin(), haystack.end(), needle.begin(), needle.end()); // After: SIMD-optimized if (valueLength == 1) { size_t result = highway_index_of_char(haystackPtr, haystackLen, valuePtr[0]); } else { void* result = highway_memmem(haystackPtr, haystackLen, valuePtr, valueLength); } ``` ## Test plan - [x] Debug build compiles successfully - [x] Basic functionality verified (`indexOf`, `includes` with single and multi-byte patterns) - [ ] Existing Buffer tests pass 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com>	2025-12-29 17:27:12 -08:00
Tommy D. Rossi	538be1399c	feat(bundler): expose reactFastRefresh option in Bun.build API (#25731 ) Fixes #25716 Adds support for a `reactFastRefresh: boolean` option in the `Bun.build` JavaScript API, matching the existing `--react-fast-refresh` CLI flag. ```ts const result = await Bun.build({ reactFastRefresh: true, entrypoints: ["src/App.tsx"], }); ``` When enabled, the bundler adds React Fast Refresh transform code (`$RefreshReg$`, `$RefreshSig$`) to the output.	2025-12-28 22:07:47 -08:00
robobun	d04b86d34f	perf: use jsonStringifyFast for faster JSON serialization (#25733 ) ## Summary Apply the same optimization technique from PR #25717 (Response.json) to other APIs that use JSON.stringify internally: - IPC message serialization (`ipc.zig`) - used for inter-process communication - console.log with %j format (`ConsoleObject.zig`) - commonly used for debugging - PostgreSQL JSON/JSONB types (`PostgresRequest.zig`) - database operations - MySQL JSON type (`MySQLTypes.zig`) - database operations - Jest %j/%o format specifiers (`jest.zig`) - test output formatting - Transpiler tsconfig/macros (`JSTranspiler.zig`) - build configuration ### Root Cause When calling `JSONStringify(globalObject, value, 0)`, the space parameter `0` becomes `jsNumber(0)`, which is NOT `undefined`. This causes JSC's FastStringifier (SIMD-optimized) to bail out: ```cpp // In WebKit's JSONObject.cpp FastStringifier::stringify() if (!space.isUndefined()) { logOutcome("space"_s); return { }; // Bail out to slow path } ``` Using `jsonStringifyFast` which passes `jsUndefined()` triggers the fast path. ### Expected Performance Improvement Based on PR #25717 results, these changes should provide ~3x speedup for JSON serialization in the affected APIs. ## Test plan - [x] Debug build compiles successfully - [x] Basic functionality verified (IPC, console.log %j, Response.json) - [x] Existing tests pass 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com>	2025-12-28 18:01:07 -08:00
robobun	37fc8e99f7	Harden WebSocket client decompression (#25724 ) ## Summary - Add maximum decompressed message size limit to WebSocket client deflate handling - Add test coverage for decompression limits ## Test plan - Run `bun test test/js/web/websocket/websocket-permessage-deflate-edge-cases.test.ts` 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com>	2025-12-28 17:58:24 -08:00
robobun	6b5de25d8a	feat(shell): add $.trace for analyzing shell commands without execution (#25667 ) ## Summary Adds `Bun.$.trace` for tracing shell commands without executing them. ```js const result = $.trace`cat /tmp/file.txt > output.txt`; // { operations: [...], cwd: "...", success: true, error: null } ``` ## Test plan - [x] `bun bd test test/js/bun/shell/trace.test.ts` 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: Jarred Sumner <jarred@jarredsumner.com>	2025-12-27 17:25:52 -08:00
Alex Miller	7b49654db6	fix(io): Prevent data corruption in `Bun.write` for files >2GB (#25720 ) Closes #8254 Fixes a data corruption bug in `Bun.write()` where files larger than 2GB would have chunks skipped resulting in corrupted output with missing data. The `doWriteLoop` had an issue where it would essentially end up offsetting twice every 2GB chunks: - it first sliced the buffer by `total_written`: ```remain = remain[@min(this.total_written, remain.len)..]``` - it would then increment `bytes_blob.offset`: `this.bytes_blob.offset += @truncate(wrote)` but because `sharedView()` already uses the blob offset `slice_ = slice_[this.offset..]` it would end up doubling the offset. In a local reproduction writing a 16GB file with each 2GB chunk filled with incrementing values `[1, 2, 3, 4, 5, 6, 7, 8]`, the buggy version produced: `[1, 3, 5, 7, …]`, skipping every other chunk. The fix is to simply remove the redundant manual offset and rely only on `total_written` to track write progress.	2025-12-27 16:58:36 -08:00
SUZUKI Sosuke	603bbd18a0	Enable `CHECK_REF_COUNTED_LIFECYCLE` in WebKit (#25705 ) ### What does this PR do? Enables `CHECK_REF_COUNTED_LIFECYCLE` in WebKit ( https://github.com/oven-sh/WebKit/pull/121 ) See also `a978fae619` #### `CHECK_REF_COUNTED_LIFECYCLE`? A compile-time macro that enables lifecycle validation for reference-counted objects in debug builds. Definition ```cpp #if ASSERT_ENABLED \|\| ENABLE(SECURITY_ASSERTIONS) #define CHECK_REF_COUNTED_LIFECYCLE 1 #else #define CHECK_REF_COUNTED_LIFECYCLE 0 #endif ``` Purpose Detects three categories of bugs: 1. Missing adoption - Objects stored in RefPtr without using adoptRef() 2. Ref during destruction - ref() called while destructor is running (causes dangling pointers) 3. Thread safety violations - Unsafe ref/deref across threads Implementation When enabled, RefCountDebugger adds two tracking flags: - m_deletionHasBegun - Set when destructor starts - m_adoptionIsRequired - Cleared when adoptRef() is called These flags are checked on every ref()/deref() call, with assertions failing on violations. Motivation Refactored debug code into a separate RefCountDebugger class to: - Improve readability of core refcount logic - Eliminate duplicate code across RefCounted, ThreadSafeRefCounted, etc. - Simplify adding new refcount classes Overhead Zero in release builds - the flags and checks are compiled out entirely. ### How did you verify your code works? --------- Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-27 15:02:11 -08:00
robobun	1d7cb4bbad	perf(Response.json): use JSC's FastStringifier by passing undefined for space (#25717 ) ## Summary - Fix performance regression where `Response.json()` was 2-3x slower than `JSON.stringify() + new Response()` - Root cause: The existing code called `JSC::JSONStringify` with `indent=0`, which internally passes `jsNumber(0)` as the space parameter. This bypasses WebKit's FastStringifier optimization. - Fix: Add a new `jsonStringifyFast` binding that passes `jsUndefined()` for the space parameter, triggering JSC's FastStringifier (SIMD-optimized) code path. ## Root Cause Analysis In WebKit's `JSONObject.cpp`, the `stringify()` function has this logic: ```cpp static NEVER_INLINE String stringify(JSGlobalObject& globalObject, JSValue value, JSValue replacer, JSValue space) { // ... if (String result = FastStringifier<Latin1Character, BufferMode::StaticBuffer>::stringify(globalObject, value, replacer, space, failureReason); !result.isNull()) return result; // Falls back to slow Stringifier... } ``` And `FastStringifier::stringify()` checks: ```cpp if (!space.isUndefined()) { logOutcome("space"_s); return { }; // Bail out to slow path } ``` So when we called `JSONStringify(globalObject, value, (unsigned)0)`, it converted to `jsNumber(0)` which is NOT `undefined`, causing FastStringifier to bail out. ## Performance Results ### Before (3.5x slower than manual approach) ``` Response.json(): 2415ms JSON.stringify() + Response(): 689ms Ratio: 3.50x ``` ### After (parity with manual approach) ``` Response.json(): ~700ms JSON.stringify() + Response(): ~700ms Ratio: ~1.09x ``` ## Test plan - [x] Existing `Response.json()` tests pass (`test/regression/issue/21257.test.ts`) - [x] Response tests pass (`test/js/web/fetch/response.test.ts`) - [x] Manual verification that output is correct for various JSON inputs Fixes #25693 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: Sosuke Suzuki <sosuke@bun.com>	2025-12-27 15:01:28 -08:00
Oleksandr Herasymov	d3a5f2eef2	perf: speed up Bun.hash.crc32 by switching to zlib CRC32 (#25692 ) ## What does this PR do? Switch `Bun.hash.crc32` to use `zlib`'s CRC32 implementation. Bun already links `zlib`, which provides highly optimized, hardware-accelerated CRC32. Because `zlib.crc32` takes a 32-bit length, chunk large inputs to avoid truncation/regressions on buffers >4GB. Note: This was tried before (PR #12164 by Jarred), which switched CRC32 to zlib for speed. This proposal keeps that approach and adds explicit chunking to avoid the >4GB length pitfall. Problem: `Bun.hash.crc32` is a significant outlier in microbenchmarks compared to other hash functions (about 21x slower than `zlib.crc32` in a 1MB test on M1). Root cause: `Bun.hash.crc32` uses Zig's `std.hash.Crc32` implementation, which is software-only and does not leverage hardware acceleration (e.g., `PCLMULQDQ` on x86 or `CRC32` instructions on ARM). Implementation: https://github.com/oven-sh/bun/blob/main/src/bun.js/api/HashObject.zig ```zig pub const crc32 = hashWrap(struct { pub fn hash(seed: u32, bytes: []const u8) u32 { // zlib takes a 32-bit length, so chunk large inputs to avoid truncation. var crc: u64 = seed; var offset: usize = 0; while (offset < bytes.len) { const remaining = bytes.len - offset; const max_len: usize = std.math.maxInt(u32); const chunk_len: u32 = if (remaining > max_len) @intCast(max_len) else @intCast(remaining); crc = bun.zlib.crc32(crc, bytes.ptr + offset, chunk_len); offset += chunk_len; } return @intCast(crc); } }); ``` ### How did you verify your code works? Benchmark (1MB payload): - Before: Bun 1.3.5 `Bun.hash.crc32` = 2,644,444 ns/op vs `zlib.crc32` = 124,324 ns/op (~21x slower) - After (local bun-debug): `Bun.hash.crc32` = 360,591 ns/op vs `zlib.crc32` = 359,069 ns/op (~1.0x), results match ## Test environment - Machine: MacBook Pro 13" (M1, 2020) - OS: macOS 15.7.3 - Baseline Bun: 1.3.5 - After Bun: local `bun-debug` (build/debug)	2025-12-26 23:41:10 -08:00
robobun	b51e993bc2	fix: reject null bytes in spawn args, env, and shell arguments (#25698 ) ## Summary - Reject null bytes in command-line arguments passed to `Bun.spawn` and `Bun.spawnSync` - Reject null bytes in environment variable keys and values - Reject null bytes in shell (`$`) template literal arguments This prevents null byte injection attacks (CWE-158) where null bytes in strings could cause unintended truncation when passed to the OS, potentially allowing attackers to bypass file extension validation or create files with unexpected names. ## Test plan - [x] Added tests in `test/js/bun/spawn/null-byte-injection.test.ts` - [x] Tests pass with debug build: `bun bd test test/js/bun/spawn/null-byte-injection.test.ts` - [x] Tests fail with system Bun (confirming the fix works) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Jarred Sumner <jarred@jarredsumner.com>	2025-12-26 23:39:37 -08:00
Dylan Conway	d0bd1b121f	Fix DCE producing invalid syntax for empty objects in spreads (#25710 ) ## Summary - Fixes dead code elimination producing invalid syntax like `{ ...a, x: }` when simplifying empty objects in spread contexts - The issue was that `simplifyUnusedExpr` and `joinAllWithCommaCallback` could return `E.Missing` instead of `null` to indicate "no side effects" - Added checks to return `null` when the result is `E.Missing` Fixes #25609 ## Test plan - [x] Added regression test that fails on v1.3.5 and passes with fix - [x] `bun bd test test/regression/issue/25609.test.ts` passes 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude <noreply@anthropic.com>	2025-12-26 21:48:19 -08:00
Nico Cevallos	5715b54614	add test for dependency order when a package's name is larger than 8 characters + fix (#25697 ) ### What does this PR do? - Add test that is broken before the changes in the code and fix previous test making script in dependency takes a bit of time to be executed. Without the `setTimeout` in the tests, due race conditions it always success. I tried adding a test combining both tests, with dependencies `dep0` and `larger-than-8-char`, but if the timeout is the same it success. - Fix for the use case added, by using the correct buffer for `Dependency.name` otherwise it gets garbage when package name is larger than 8 characters. This should fix #12203 ### How did you verify your code works? Undo the changes in the code to verify the new test fails and check it again after adding the changes in the code.	2025-12-25 23:49:23 -08:00
SUZUKI Sosuke	699d8b1e1c	Upgrade WebKit Dec 24, 2025 (#25684 ) - WTFMove → WTF::move / std::move: Replaced WTFMove() macro with WTF::move() function for WTF types, std::move() for std types - SortedArrayMap removed: Replaced with if-else chains in EventFactory.cpp, JSCryptoKeyUsage.cpp - Wasm::Memory::create signature changed: Removed VM parameter - URLPattern allocation: Changed from WTF_MAKE_ISO_ALLOCATED to WTF_MAKE_TZONE_ALLOCATED --------- Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>	2025-12-25 14:00:58 -08:00
SUZUKI Sosuke	bffccf3d5f	Upgrade WebKit 2025/12/07 (#25429 ) Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: Jarred Sumner <jarred@jarredsumner.com> Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com> Co-authored-by: Claude Bot <claude-bot@bun.sh>	2025-12-23 22:24:18 -08:00
robobun	8484e1b827	perf: shrink ConcurrentTask from 24 bytes to 16 bytes (#25636 )	2025-12-22 12:07:24 -08:00
robobun	3898ed5e3f	perf: pack boolean flags and reorder fields to reduce struct padding (#25627 )	2025-12-21 17:12:42 -08:00
Jarred Sumner	c08ffadf56	perf(linux): add memfd optimizations and typed flags (#25597 ) ## Summary - Add `MemfdFlags` enum to replace raw integer flags for `memfd_create`, providing semantic clarity for different use cases (`executable`, `non_executable`, `cross_process`) - Add support for `MFD_EXEC` and `MFD_NOEXEC_SEAL` flags (Linux 6.3+) with automatic fallback to older kernel flags when `EINVAL` is returned - Use memfd + `/proc/self/fd/{fd}` path for loading embedded `.node` files in standalone builds, avoiding disk writes entirely on Linux ## Test plan - [ ] Verify standalone builds with embedded `.node` files work on Linux - [ ] Verify fallback works on older kernels (pre-6.3) - [ ] Verify subprocess stdio memfd still works correctly 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-12-19 23:18:21 -08:00
Dylan Conway	fa983247b2	fix(create): crash when running postinstall task with --no-install (#25616 ) ## Summary - Fix segmentation fault in `bun create` when using `--no-install` with a template that has a `bun-create.postinstall` task starting with "bun " - The bug was caused by unconditionally slicing `argv[2..]` which created an empty array when `npm_client` was null - Added check for `npm_client != null` before slicing ## Reproduction ```bash # Create template with bun-create.postinstall mkdir -p ~/.bun-create/test-template echo '{"name":"test","bun-create":{"postinstall":"bun install"}}' > ~/.bun-create/test-template/package.json # This would crash before the fix bun create test-template /tmp/my-app --no-install ``` ## Test plan - [x] Verified the reproduction case crashes before the fix - [x] Verified the reproduction case works after the fix 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-19 23:17:51 -08:00
Dylan Conway	99b0a16c33	fix: prevent out-of-bounds access in NO_PROXY parsing (#25617 ) ## Summary - Fix out-of-bounds access when parsing `NO_PROXY` environment variable with empty entries - Empty entries (e.g., `"localhost, , example.com"`) would cause a panic when checking if the host starts with a dot - Skip empty entries after trimming whitespace fixes BUN-110G fixes BUN-128V ## Test plan - [x] Verify `NO_PROXY="localhost, , example.com"` no longer crashes 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-12-19 23:17:29 -08:00
Dylan Conway	085e25d5d1	fix: protect StringOrBuffer from GC in async operations (#25594 ) ## Summary - Fix use-after-free crash in async zstd compression, scrypt, and JSTranspiler operations - When `StringOrBuffer.fromJSMaybeAsync` is called with `is_async=true`, the buffer's JSValue is now protected from garbage collection - Previously, the buffer could be GC'd while a worker thread was still accessing it, causing segfaults in zstd's `HIST_count_simple` and similar functions Fixes BUN-167Z ## Changes - `fromJSMaybeAsync`: Call `protect()` on buffer when `is_async=true` - `fromJSWithEncodingMaybeAsync`: Same protection for the early return path - `Scrypt`: Fix cleanup to use `deinitAndUnprotect()` for async path, add missing `deinit()` in sync path - `JSTranspiler`: Use new protection mechanism instead of manual `protect()`/`unprotect()` calls - Simplify `createOnJSThread` signatures to not return errors (OOM is handled internally) - Update all callers to use renamed/simplified APIs ## Test plan - [x] Code review of all callsites to verify correct protect/unprotect pairing - [ ] Run existing zstd tests - [ ] Run existing scrypt tests - [ ] Run existing transpiler tests 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-19 17:30:26 -08:00
Jarred Sumner	ce5c336ea5	Revert "fix: memory leaks in IPC message handling (#25602 )" This reverts commit `05b12e0ed0`. The tests did not fail with system version of Bun.	2025-12-19 17:28:54 -08:00
robobun	05b12e0ed0	fix: memory leaks in IPC message handling (#25602 ) ## Summary - Add periodic memory reclamation for IPC buffers after processing messages - Fix missing `deref()` on `bun.String` created from `cmd` property in `handleIPCMessage` - Add `reclaimMemory()` function to shrink incoming buffer and send queue when they exceed 2MB capacity - Track message count to trigger memory reclamation every 256 messages The incoming `ByteList` buffer and send queue `ArrayList` would grow but never shrink, causing memory accumulation during sustained IPC messaging. ## Test plan - [x] Added regression tests in `test/js/bun/spawn/spawn-ipc-memory.test.ts` - [x] Existing IPC tests pass (`spawn.ipc.test.ts`) - [x] Existing cluster tests pass 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com>	2025-12-19 17:27:09 -08:00
Angus Comrie	d9459f8540	Fix postgres empty check when handling arrays (#25607 ) ### What does this PR do? Closes #25505. This adjusts the byte length check in `DataCell: fromBytes` to 12 bytes instead of 16, as zero-dimensional arrays will have a shorter preamble. ### How did you verify your code works? Test suite passes, and I've added a new test that fails in the main branch but passes with this change. The issue only seems to crop up when a connection is _reused_, which is curious.	2025-12-19 14:49:12 -08:00
Jarred Sumner	e79b512a9d	Propagate debugger CLI config in single-file executables (#25600 )	2025-12-19 09:49:02 -08:00
robobun	9902039b1f	fix: memory leaks in error-handling code for Brotli, Zstd, and Zlib compression state machines (#25592 ) ## Summary Fix several memory leaks in the compression libraries: - NativeBrotli/NativeZstd reset() - Each call to `reset()` allocated a new encoder/decoder without freeing the previous one - NativeBrotli/NativeZstd init() error paths - If `setParams()` failed after `stream.init()` succeeded, the instance was leaked - NativeZstd init() - If `setPledgedSrcSize()` failed after context creation, the context was leaked - ZlibCompressorArrayList - After `deflateInit2_()` succeeded, if `ensureTotalCapacityPrecise()` failed with OOM, zlib internal state was never freed - NativeBrotli close() - Now sets state to null to prevent potential double-free (defensive) - LibdeflateState - Added `deinit()` for API consistency ## Test plan - [x] Added regression test that calls `reset()` 100k times and measures memory growth - [x] Test shows memory growth dropped from ~600MB to ~10MB for Brotli - [x] Verified no double-frees by tracing code paths - [x] Existing zlib tests pass (except pre-existing timeout in debug build) Before fix (system bun 1.3.3): ``` Memory growth after 100000 reset() calls: 624.38 MB (BrotliCompress) Memory growth after 100000 reset() calls: 540.63 MB (BrotliDecompress) ``` After fix: ``` Memory growth after 100000 reset() calls: 11.84 MB (BrotliCompress) Memory growth after 100000 reset() calls: 0.16 MB (BrotliDecompress) ``` 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com>	2025-12-18 21:42:14 -08:00
Dylan Conway	f3fd7506ef	fix(windows): handle UV_UNKNOWN and UV_EAI_* error codes in libuv errno mapping (#25596 ) ## Summary - Add missing `UV_UNKNOWN` and `UV_EAI_` error code mappings to the `errno()` function in `ReturnCode` - Fixes panic "integer does not fit in destination type" on Windows when libuv returns unmapped error codes - Speculative fix for BUN-131E ## Root Cause The `errno()` function was missing mappings for `UV_UNKNOWN` (-4094) and all `UV_EAI_` address info errors (-3000 to -3014). When libuv returned these codes, the switch fell through to `else => null`, and the caller at `sys_uv.zig:317` assumed success and tried to cast the negative return code to `usize`, causing a panic. This was triggered in `readFileWithOptions` -> `preadv` when: - Memory-mapped file operations encounter exceptions (file modified/truncated by another process, network drive issues) - Windows returns error codes that libuv cannot map to standard errno values ## Crash Report ``` Bun v1.3.5 (`1e86ceb`) on windows x86_64baseline [] panic: integer does not fit in destination type sys_uv.zig:294: preadv node_fs.zig:5039: readFileWithOptions ``` ## Test plan - [ ] This fix prevents a panic, converting it to a proper error. Testing would require triggering `UV_UNKNOWN` from libuv, which is difficult to do reliably (requires memory-mapped file exceptions or unusual Windows errors). 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-18 21:41:41 -08:00
Dylan Conway	57cbbc09e4	fix: correct off-by-one bounds checks in bundler and package installer (#25582 ) ## Summary - Fix two off-by-one bounds check errors that used `>` instead of `>=` - Both bugs could cause undefined behavior (array out-of-bounds access) when an index equals the array length ## The Bugs ### 1. `src/install/postinstall_optimizer.zig:62` ```zig // Before (buggy): if (resolution > metas.len) continue; const meta: *const Meta = &metas[resolution]; // Out-of-bounds when resolution == metas.len // After (fixed): if (resolution >= metas.len) continue; ``` ### 2. `src/bundler/linker_context/doStep5.zig:10` ```zig // Before (buggy): if (id > c.graph.meta.len) return; const resolved_exports = &c.graph.meta.items(.resolved_exports)[id]; // Out-of-bounds when id == c.graph.meta.len // After (fixed): if (id >= c.graph.meta.len) return; ``` ## Why These Are Bugs Valid array indices are `0` to `len - 1`. When `index == len`: - `index > len` evaluates to `false` → check passes - `array[index]` accesses `array[len]` → out-of-bounds / undefined behavior ## Codebase Patterns The rest of the codebase correctly uses `>=` for these checks: - `lockfile.zig:484`: `if (old_resolution >= old.packages.len) continue;` - `lockfile.zig:522`: `if (old_resolution >= old.packages.len) continue;` - `LinkerContext.zig:389`: `if (source_index >= import_records_list.len) continue;` - `LinkerContext.zig:1667`: `if (source_index >= c.graph.ast.len) {` ## Test plan - [x] Verified fix aligns with existing codebase patterns - [ ] CI passes 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude <noreply@anthropic.com>	2025-12-18 18:04:28 -08:00
Dylan Conway	8941a363c3	fix: dupe `ca` string in .npmrc to prevent use-after-free (#25563 ) ## Summary - Fix use-after-free bug when parsing `ca` option from `.npmrc` - The `ca` string was being stored directly from the parser's arena without duplication - Since the parser arena is freed at the end of `loadNpmrc`, this created a dangling pointer ## The Bug In `src/ini.zig`, the `ca` string wasn't being duplicated like all other string properties: ```zig // Lines 983-986 explicitly warn about this: // Need to be very, very careful here with strings. // They are allocated in the Parser's arena, which of course gets // deinitialized at the end of the scope. // We need to dupe all strings // Line 981: Parser arena is freed here defer parser.deinit(); // Line 1016-1020: THE BUG - string not duped! if (out.asProperty("ca")) \|query\| { if (query.expr.asUtf8StringLiteral()) \|str\| { install.ca = .{ .str = str, // ← Dangling pointer after parser.deinit()! }; ``` All other string properties in the same function correctly duplicate: - `registry` (line 996): `try allocator.dupe(u8, str)` - `cache` (line 1002): `try allocator.dupe(u8, str)` - `cafile` (line 1037): `asStringCloned(allocator)` - `ca` array items (line 1026): `asStringCloned(allocator)` ## User Impact When a user has `ca=<certificate>` in their `.npmrc` file: 1. The certificate string is parsed and stored 2. The parser arena is freed 3. `install.ca.str` becomes a dangling pointer 4. Later TLS/SSL operations access freed memory 5. Could cause crashes, undefined behavior, or security issues ## Test plan - Code inspection confirms this matches the pattern used for all other string properties - The fix adds `try allocator.dupe(u8, str)` to match `cache`, `registry`, etc. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude <noreply@anthropic.com>	2025-12-17 19:56:25 -08:00
Dylan Conway	722ac3aa5a	fix: check correct variable in subprocess stdin cleanup (#25562 ) ## Summary - Fix typo in `onProcessExit` where `existing_stdin_value.isCell()` was checked instead of `existing_value.isCell()` - Since `existing_stdin_value` is always `.zero` at that point, the condition was always false, making the inner block dead code ## The Bug In `src/bun.js/api/bun/subprocess.zig:593`: ```zig var existing_stdin_value = jsc.JSValue.zero; // Line 590 - always .zero if (this_jsvalue != .zero) { if (jsc.Codegen.JSSubprocess.stdinGetCached(this_jsvalue)) \|existing_value\| { if (existing_stdin_value.isCell()) { // BUG! Should be existing_value // This block was DEAD CODE - never executed ``` Compare with the correct pattern used elsewhere: ```zig // shell/subproc.zig:251-252 (CORRECT) if (jsc.Codegen.JSSubprocess.stdinGetCached(subprocess.this_jsvalue)) \|existing_value\| { jsc.WebCore.FileSink.JSSink.setDestroyCallback(existing_value, 0); // Uses existing_value } ``` ## Impact The dead code prevented: - Recovery of stdin from cached JS value when `weak_file_sink_stdin_ptr` is null - Proper cleanup via `onAttachedProcessExit` on the FileSink - `setDestroyCallback` cleanup in `onProcessExit` Note: The user-visible impact was mitigated by redundant cleanup paths in `Writable.zig` that also call `setDestroyCallback`. ## Test plan - Code inspection confirms this is a straightforward typo fix - Existing subprocess tests continue to pass 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude <noreply@anthropic.com>	2025-12-17 18:34:58 -08:00
Dylan Conway	a333d02f84	fix: correct inverted buffer allocation logic in Postgres array parsing (#25564 ) ## Summary - Fix inverted buffer allocation logic when parsing strings in Postgres arrays - Strings larger than 16KB were incorrectly using the stack buffer instead of dynamically allocating - This caused spurious `InvalidByteSequence` errors for valid data ## The Bug In `src/sql/postgres/DataCell.zig`, the condition for when to use dynamic allocation was inverted: ```zig // BEFORE (buggy): const needs_dynamic_buffer = str_bytes.len < stack_buffer.len; // TRUE when SMALL // AFTER (fixed): const needs_dynamic_buffer = str_bytes.len > stack_buffer.len; // TRUE when LARGE ``` ## What happened with large strings (>16KB): 1. `needs_dynamic_buffer` = false (e.g., 20000 < 16384 is false) 2. Uses `stack_buffer[0..]` which is only 16KB 3. `unescapePostgresString` hits bounds check and returns `BufferTooSmall` 4. Error converted to `InvalidByteSequence` 5. User gets error even though data is valid ## User Impact Users with Postgres arrays containing JSON or string elements larger than 16KB would get spurious InvalidByteSequence errors even though their data was perfectly valid. ## Test plan - Code inspection confirms the logic was inverted - The fix aligns with the intended behavior: use stack buffer for small strings, dynamic allocation for large strings 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude <noreply@anthropic.com>	2025-12-17 18:34:17 -08:00
Dylan Conway	c1acb0b9a4	fix(shell): prevent double-close of fd when using &> redirect with builtins (#25568 ) ## Summary - Fix double-close of file descriptor when using `&>` redirect with shell builtin commands - Add `dupeRef()` helper for cleaner reference counting semantics - Add tests for `&>` and `&>>` redirects with builtins ## Test plan - [x] Added tests in `test/js/bun/shell/file-io.test.ts` that reproduce the bug - [x] All file-io tests pass ## The Bug When using `&>` to redirect both stdout and stderr to the same file with a shell builtin command (e.g., `pwd &> file.txt`), the code was creating two separate `IOWriter` instances that shared the same file descriptor. When both `IOWriter`s were destroyed, they both tried to close the same fd, causing an `EBADF` (bad file descriptor) error. ```javascript import { $ } from "bun"; await $`pwd &> output.txt`; // Would crash with EBADF ``` ## The Fix 1. Share a single `IOWriter` between stdout and stderr when both are redirected to the same file, with proper reference counting 2. Rename `refSelf` to `dupeRef` for clarity across `IOReader`, `IOWriter`, `CowFd`, and add it to `Blob` for consistency 3. Fix the `Body.Value` blob case to also properly reference count when the same blob is assigned to multiple outputs 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Latest model <noreply@anthropic.com>	2025-12-17 18:33:53 -08:00
190n	fa5a5bbe55	fix: v8::Value::IsInt32()/IsUint32() edge cases (#25548 ) ### What does this PR do? - fixes both functions returning false for double-encoded values (even if the numeric value is a valid int32/uint32) - fixes IsUint32() returning false for values that don't fit in int32 - fixes the test from #22462 not testing anything (the native functions were being passed a callback to run garbage collection as the first argument, so it was only ever testing what the type check APIs returned for that function) - extends the test to cover the first edge case above ### How did you verify your code works? The new tests fail without these fixes. --------- Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>	2025-12-17 00:52:16 -08:00
robobun	bc47f87450	fix(ini): support env var expansion in quoted .npmrc values (#25518 ) ## Summary Fixes environment variable expansion in quoted `.npmrc` values and adds support for the `?` optional modifier. ### Changes Simplified quoted value handling: - Removed unnecessary `isProperlyQuoted` check that added complexity without benefit - When JSON.parse succeeds for quoted strings, expand env vars in the result - When JSON.parse fails for single-quoted strings like `'${VAR}'`, still expand env vars Added `?` modifier support (matching npm behavior): - `${VAR}` - if VAR is undefined, leaves as `${VAR}` (no expansion) - `${VAR?}` - if VAR is undefined, expands to empty string This applies consistently to both quoted and unquoted values. ### Examples ```ini # Env var found - all expand to the value token = ${NPM_TOKEN} token = "${NPM_TOKEN}" token = '${NPM_TOKEN}' # Env var NOT found - left as-is token = ${NPM_TOKEN} # → ${NPM_TOKEN} token = "${NPM_TOKEN}" # → ${NPM_TOKEN} token = '${NPM_TOKEN}' # → ${NPM_TOKEN} # Optional modifier (?) - expands to empty if not found token = ${NPM_TOKEN?} # → (empty) token = "${NPM_TOKEN?}" # → (empty) auth = "Bearer ${TOKEN?}" # → Bearer ``` ### Test Plan - Added 8 new tests for the `?` modifier covering quoted and unquoted values - Verified all expected values match `npm config get` behavior - All 30 ini tests pass 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Jarred Sumner <jarred@jarredsumner.com> Co-authored-by: Dylan Conway <dylan.conway567@gmail.com>	2025-12-16 19:49:23 -08:00
robobun	b135c207ed	fix(yaml): remove YAML 1.1 legacy boolean values for YAML 1.2 compliance (#25537 ) ## Summary - Remove YAML 1.1 legacy boolean values (`yes/no/on/off/y/Y`) that are not part of the YAML 1.2 Core Schema - Keep YAML 1.2 Core Schema compliant values: `true/True/TRUE`, `false/False/FALSE`, `null/Null/NULL`, `0x` hex, `0o` octal - Add comprehensive roundtrip tests for YAML 1.2 compliance Removed (now parsed as strings): - `yes`, `Yes`, `YES` (were `true`) - `no`, `No`, `NO` (were `false`) - `on`, `On`, `ON` (were `true`) - `off`, `Off`, `OFF` (were `false`) - `y`, `Y` (were `true`) This fixes a common pain point where GitHub Actions workflow files with `on:` keys would have the key parsed as boolean `true` instead of the string `"on"`. ## YAML 1.2 Core Schema Specification From [YAML 1.2.2 Section 10.3.2 Tag Resolution](https://yaml.org/spec/1.2.2/#1032-tag-resolution): \| Regular expression \| Resolved to tag \| \|-------------------\|-----------------\| \| `null \\| Null \\| NULL \\| ~` \| tag:yaml.org,2002:null \| \| `/* Empty /` \| tag:yaml.org,2002:null \| \| `true \\| True \\| TRUE \\| false \\| False \\| FALSE` \| tag:yaml.org,2002:bool \| \| `[-+]? [0-9]+` \| tag:yaml.org,2002:int (Base 10) \| \| `0o [0-7]+` \| tag:yaml.org,2002:int (Base 8) \| \| `0x [0-9a-fA-F]+` \| tag:yaml.org,2002:int (Base 16) \| \| `[-+]? ( \. [0-9]+ \\| [0-9]+ ( \. [0-9] )? ) ( [eE] [-+]? [0-9]+ )?` \| tag:yaml.org,2002:float \| \| `[-+]? ( \.inf \\| \.Inf \\| \.INF )` \| tag:yaml.org,2002:float (Infinity) \| \| `\.nan \\| \.NaN \\| \.NAN` \| tag:yaml.org,2002:float (Not a number) \| Note: `yes`, `no`, `on`, `off`, `y`, `n` are not in the YAML 1.2 Core Schema boolean list. These were removed from YAML 1.1 as noted in [YAML 1.2 Section 1.2](https://yaml.org/spec/1.2.2/#12-yaml-history): > The YAML 1.2 specification was published in 2009. Its primary focus was making YAML a strict superset of JSON. It also removed many of the problematic implicit typing recommendations. ## Test plan - [x] Updated existing YAML tests to reflect YAML 1.2 Core Schema behavior - [x] Added roundtrip tests (stringify → parse) for YAML 1.2 compliance - [x] Verified tests fail with system Bun (YAML 1.1 behavior) and pass with debug build (YAML 1.2) - [x] Run `bun bd test test/js/bun/yaml/yaml.test.ts` 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com>	2025-12-16 14:29:39 -08:00
Ciro Spaciari	a1dd26d7db	fix(usockets) fix last_write_failed flag (#25496 ) https://github.com/oven-sh/bun/pull/25361 needs to be merged before this PR ## Summary - Move `last_write_failed` flag from loop-level to per-socket flag for correctness ## Changes - Move `last_write_failed` from `loop->data` to `socket->flags` - More semantically correct since write status is per-socket, not per-loop 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-16 14:26:42 -08:00
robobun	dd04c57258	feat: implement V8 Value type checking APIs (#22462 ) ## Summary This PR implements four V8 C++ API methods for type checking that are commonly used by native Node.js modules: - `v8::Value::IsMap()` - checks if value is a Map - `v8::Value::IsArray()` - checks if value is an Array - `v8::Value::IsInt32()` - checks if value is a 32-bit integer - `v8::Value::IsBigInt()` - checks if value is a BigInt ## Implementation Details The implementation maps V8's type checking APIs to JavaScriptCore's equivalent functionality: - `IsMap()` uses JSC's `inherits<JSC::JSMap>()` check - `IsArray()` uses JSC's `isArray()` function with the global object - `IsInt32()` uses JSC's `isInt32()` method - `IsBigInt()` uses JSC's `isBigInt()` method ## Changes - Added method declarations to `V8Value.h` - Implemented the methods in `V8Value.cpp` - Added symbol exports to `napi.zig` (both Unix and Windows mangled names) - Added symbols to `symbols.txt` and `symbols.dyn` - Added comprehensive tests in `v8-module/main.cpp` and `v8.test.ts` ## Testing The implementation has been verified to: - Compile successfully without errors - Export the correct symbols in the binary - Follow established patterns in the V8 compatibility layer Tests cover various value types including empty and populated Maps/Arrays, different numeric ranges, BigInts, and other JavaScript types. 🤖 Generated with [Claude Code](https://claude.ai/code) --------- Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: Jarred Sumner <jarred@jarredsumner.com>	2025-12-15 19:50:11 -08:00
robobun	344b2c1dfe	fix: Response.clone() no longer locks body when body was accessed before clone (#25484 ) ## Summary - Fix bug where `Response.clone()` would lock the original response's body when `response.body` was accessed before cloning - Apply the same fix to `Request.clone()` ## Root Cause When `response.body` was accessed before calling `response.clone()`, the original response's body would become locked after cloning. This happened because: 1. When the cloned response was wrapped with `toJS()`, `checkBodyStreamRef()` was called which moved the stream from `Locked.readable` to `js.gc.stream` and cleared `Locked.readable` 2. The subsequent code tried to get the stream from `Locked.readable`, which was now empty, so the body cache update was skipped 3. The JavaScript-level body property cache still held the old locked stream ## Fix Updated the cache update logic to: 1. For the cloned response: use `js.gc.stream.get()` instead of `Locked.readable.get()` since `toJS()` already moved the stream 2. For the original response: use `Locked.readable.get()` which still holds the teed stream since `checkBodyStreamRef` hasn't been called yet ## Reproduction ```javascript const readableStream = new ReadableStream({ start(controller) { controller.enqueue(new TextEncoder().encode("Hello, world!")); controller.close(); }, }); const response = new Response(readableStream); console.log(response.body?.locked); // Accessing body before clone const cloned = response.clone(); console.log(response.body?.locked); // Expected: false, Actual: true ❌ console.log(cloned.body?.locked); // Expected: false, Actual: false ✅ ``` ## Test plan - [x] Added regression tests for `Response.clone()` in `test/js/web/fetch/response.test.ts` - [x] Added regression test for `Request.clone()` in `test/js/web/request/request.test.ts` - [x] Verified tests fail with system bun (before fix) and pass with debug build (after fix) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Bot <claude-bot@bun.sh> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Jarred Sumner <jarred@jarredsumner.com>	2025-12-15 18:46:02 -08:00
Ciro Spaciari	aef0b5b4a6	fix(usockets): safely handle socket reallocation during context adoption (#25361 ) ## Summary - Fix use-after-free vulnerability during socket adoption by properly tracking reallocated sockets - Add safety checks to prevent linking closed sockets to context lists - Properly track socket state with new `is_closed`, `adopted`, and `is_tls` flags ## What does this PR do? This PR improves event loop stability by addressing potential use-after-free issues that can occur when sockets are reallocated during adoption (e.g., when upgrading a TCP socket to TLS). ### Key Changes Socket State Tracking ([internal.h](packages/bun-usockets/src/internal/internal.h)) - Added `is_closed` flag to explicitly track when a socket has been closed - Added `adopted` flag to mark sockets that were reallocated during context adoption - Added `is_tls` flag to track TLS socket state for proper low-priority queue handling Safe Socket Adoption ([context.c](packages/bun-usockets/src/context.c)) - When `us_poll_resize()` returns a new pointer (reallocation occurred), the old socket is now: - Marked as closed (`is_closed = 1`) - Added to the closed socket cleanup list - Marked as adopted (`adopted = 1`) - Has its `prev` pointer set to the new socket for event redirection - Added guards to `us_internal_socket_context_link_socket/listen_socket/connecting_socket` to prevent linking already-closed sockets Event Loop Handling ([loop.c](packages/bun-usockets/src/loop.c)) - After callbacks that can trigger socket adoption (`on_open`, `on_writable`, `on_data`), the event loop now checks if the socket was reallocated and redirects to the new socket - Low-priority socket handling now properly checks `is_closed` state and uses `is_tls` flag for correct SSL handling Poll Resize Safety ([epoll_kqueue.c](packages/bun-usockets/src/eventing/epoll_kqueue.c)) - Changed `us_poll_resize()` to always allocate new memory with `us_calloc()` instead of `us_realloc()` to ensure the old pointer remains valid for cleanup - Now takes `old_ext_size` parameter to correctly calculate memory sizes - Re-enabled `us_internal_loop_update_pending_ready_polls()` call in `us_poll_change()` to ensure pending events are properly redirected ### How did you verify your code works? Run existing CI and existing socket upgrade tests under asan build	2025-12-15 18:43:51 -08:00

1 2 3 4 5 ...

9112 Commits