seaweedfs

mirrors/seaweedfs

Fork 0

mirror of https://github.com/seaweedfs/seaweedfs.git synced 2026-07-25 01:22:39 +00:00

Files

T

History

Chris LuandGitHub 5fbe39320c fix(volume_server): pin EC shard auto-select to the .ecx-owning disk (#9212 ) (#9245 )

* fix(volume_server): pin EC shard auto-select to the .ecx-owning disk (#9212)

ec.rebuild only sets CopyEcxFile=true on the first shard sent to the
rebuilder; subsequent shards rely on VolumeEcShardsCopy / ReceiveFile
auto-select to land on the same disk. The old auto-select used
FindEcVolume (in-memory) to detect the "already has this volume" case.
Mid-rebuild, no EC volume has been mounted yet on the destination, so
FindEcVolume returns nothing and the fallback picks "any HDD with free
space" — which can split shards from their .ecx across disks of the
same node and feed the orphan-shard layout reported in #9212 / fixed
on the loader side in #9244.

Add Store.FindEcShardTargetLocation as the canonical placement
primitive: prefer a mounted EC volume, then a disk that has the .ecx
on disk, then any HDD, then any disk. DiskLocation.HasEcxFileOnDisk is
the new on-disk check, and it looks at IdxDirectory first with a
fallback to Directory to handle .ecx written before -dir.idx was
configured.

Both VolumeEcShardsCopy and ReceiveFile now route through the new
helper, dropping their duplicated 4-level fallback ladder. No protocol
changes; explicit DiskId callers are unaffected.

* fix(volume_server): treat directories named *.ecx as no-match in HasEcxFileOnDisk

os.Stat(".ecx") succeeds for both files and directories. If something
happens to leave a directory named X.ecx in the data or idx folder,
HasEcxFileOnDisk would currently report true and FindEcShardTargetLocation
would route shards to that disk — where NewEcVolume's eventual
OpenFile(O_RDWR) on the same path errors out.

Add a !info.IsDir() check on both stat sites. Cheap and conservative.

Suggested in PR #9245 review by @gemini-code-assist.

* refactor(volume_server): collapse EC placement helper to a single pass

FindEcShardTargetLocation called FindFreeLocation up to four times. Each
call iterates s.Locations and acquires VolumesLen / EcShardCount RLocks
per disk — for a typical 4-disk node that's 32 RLock cycles per
placement decision.

Walk s.Locations once, score each disk by tier (mounted > .ecx-on-disk
> HDD > any-disk), break ties by free count. The free-slot math is
factored into a small helper that mirrors FindFreeLocation's formula
without re-entering the location's locks. Behaviour is unchanged: each
existing tier still wins over later tiers, and within a tier the disk
with the most free count still wins, matching the original max-tracking
in FindFreeLocation.

Suggested in PR #9245 review by @gemini-code-assist.

* refactor(volume_server): thread dataShardCount as a parameter through EC placement

ecFreeShardCount and FindEcShardTargetLocation referenced
erasure_coding.DataShardsCount directly. Take it as a parameter so
custom-ratio builds (e.g. enterprise) can swap the default without
touching the helper itself, and so unit tests can pin a specific ratio
independent of the package constant. Default callsites in
VolumeEcShardsCopy and ReceiveFile now pass the package default
explicitly; tests pass a literal 10 for clarity.

* fix(volume_server): treat MaxVolumeCount=0 as unlimited in EC placement

ecFreeShardCount computed `MaxVolumeCount - VolumesLen()` and went
negative when MaxVolumeCount was 0 — the "unlimited disk" sentinel
already honoured by Store.hasFreeDiskLocation and friends. With a
negative free count, FindEcShardTargetLocation's `freeCount <= 0`
guard skipped the disk entirely, so unlimited disks could never receive
EC shards via the placement helper.

Special-case MaxVolumeCount<=0: report a synthetic large free count
that decrements with current usage, so unlimited disks are eligible
and tie-breaks still prefer the less-loaded one. Added
TestFindEcShardTargetLocation_HonoursUnlimitedDisk as the regression.

Reported in PR #9245 review by @gemini-code-assist.

* fix(volume_server): account in shard slots, not volume slots, in ecFreeShardCount

FindFreeLocation in store.go ends with `free /= DataShardsCount`,
converting "shard slots free" back to "volume-equivalent slots." The
truncation is harmless there, but my new ecFreeShardCount inherited
the same final divide and re-introduced exactly the orphan-shard
hazard #9245 was meant to prevent: with MaxVolumeCount=1,
VolumesLen=0, EcShardCount=1 the formula reports 0 even though the
disk has room for 9 more shards, so subsequent shards route off the
.ecx-owning disk into the HDD-fallback tier.

Drop the trailing divide and return the count directly in shard slots.
Same shape, finer granularity; tie-breaks still order by free count.
The unlimited branch's "used" calculation is updated to match (mix
volume-slots and shard-slots in shard units). Added
TestFindEcShardTargetLocation_TightProvisioningKeepsEcxDisk as the
regression.

Reported in PR #9245 review by @coderabbitai.

2026-04-27 15:59:57 -07:00

constants

Nit: use time.Durations instead of constants in seconds. (#7438 )

2025-11-04 13:02:22 -08:00

filer_ui

chore: execute goimports to format the code (#7983 )

2026-01-07 13:06:08 -08:00

master_ui

feat: improve aio support for admin/volume ingress and fix UI links (#8679 )

2026-03-18 13:20:55 -07:00

nfs

fix(nfs): make Linux mount -t nfs work without client workaround (#9199 ) (#9201 )

2026-04-23 13:53:53 -07:00

postgres

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

volume_server_ui

fix: EC UI template error when viewing shard details (#7955 )

2026-01-03 22:45:48 -08:00

common_test.go

…

common.go

fix(filer): return 503 + Retry-After when remote object not cached yet (#9236 )

2026-04-27 01:58:33 -07:00

filer_grpc_server_admin.go

chore: execute goimports to format the code (#7983 )

2026-01-07 13:06:08 -08:00

filer_grpc_server_dlm_test.go

dlm: resilient distributed locks via consistent hashing + backup replication (#8860 )

2026-03-30 23:29:56 -07:00

filer_grpc_server_dlm.go

dlm: resilient distributed locks via consistent hashing + backup replication (#8860 )

2026-03-30 23:29:56 -07:00

filer_grpc_server_kv.go

chore: execute goimports to format the code (#7983 )

2026-01-07 13:06:08 -08:00

filer_grpc_server_mount_peer_test.go

chore(filer): remove -mount.p2p flag; registry is always on (#9183 )

2026-04-21 23:00:11 -07:00

filer_grpc_server_mount_peer.go

chore(filer): remove -mount.p2p flag; registry is always on (#9183 )

2026-04-21 23:00:11 -07:00

filer_grpc_server_remote.go

fix(filer/remote): keep re-cache work alive past caller cancellation (#9174 ) (#9193 )

2026-04-22 17:56:15 -07:00

filer_grpc_server_rename_test.go

Adjust rename events metadata format (#8854 )

2026-03-30 18:25:11 -07:00

filer_grpc_server_rename.go

[nfs] Add NFS (#9067 )

2026-04-14 20:48:24 -07:00

filer_grpc_server_stream_mutate_bench_test.go

perf(filer): parallelize StreamMutateEntry with path-keyed scheduler (#9171 )

2026-04-21 11:25:09 -07:00

filer_grpc_server_stream_mutate_scheduler_test.go

perf(filer): parallelize StreamMutateEntry with path-keyed scheduler (#9171 )

2026-04-21 11:25:09 -07:00

filer_grpc_server_stream_mutate_scheduler.go

perf(filer): parallelize StreamMutateEntry with path-keyed scheduler (#9171 )

2026-04-21 11:25:09 -07:00

filer_grpc_server_stream_mutate.go

perf(filer): parallelize StreamMutateEntry with path-keyed scheduler (#9171 )

2026-04-21 11:25:09 -07:00

filer_grpc_server_sub_meta_test.go

fix(tests): make tests pass on 32-bit architectures (#9168 ) (#9170 )

2026-04-20 22:48:01 -07:00

filer_grpc_server_sub_meta.go

fix(filer): eliminate redundant disk reads causing memory/CPU regression (#9039 )

2026-04-11 23:12:54 -07:00

filer_grpc_server_test.go

fix(filer): apply default disk type after location-prefix resolution in gRPC AssignVolume (#8836 )

2026-03-29 14:18:24 -07:00

filer_grpc_server_traverse_meta_test.go

chore: execute goimports to format the code (#7983 )

2026-01-07 13:06:08 -08:00

filer_grpc_server_traverse_meta.go

Add error list each entry func (#7485 )

2025-11-25 19:35:19 -08:00

filer_grpc_server.go

feat: pass expected_data_size from clients for size-aware assignment (#9032 )

2026-04-11 11:30:47 -07:00

filer_jwt_test.go

fix Filer startup failure due to JWT on / path #8149 (#8167 )

2026-01-29 21:45:15 -08:00

filer_server_handlers_copy_test.go

Use filer-side copy for mounted whole-file copy_file_range (#8747 )

2026-03-23 18:35:15 -07:00

filer_server_handlers_copy.go

fix(mount): remove fid pool to stop master over-allocating volumes (#9111 )

2026-04-16 15:51:13 -07:00

filer_server_handlers_iam_grpc.go

fix(s3): include static identities in listing operations (#8903 )

2026-04-03 20:01:28 -07:00

filer_server_handlers_proxy_test.go

fix(filer): limit concurrent proxy reads per volume server (#8608 )

2026-03-11 23:32:09 -07:00

filer_server_handlers_proxy.go

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

filer_server_handlers_read_dir.go

chore: execute goimports to format the code (#7983 )

2026-01-07 13:06:08 -08:00

filer_server_handlers_read.go

fix(filer): return 503 + Retry-After when remote object not cached yet (#9236 )

2026-04-27 01:58:33 -07:00

filer_server_handlers_tagging.go

Changes logging function (#6919 )

2025-06-24 08:44:06 -07:00

filer_server_handlers_write_autochunk.go

fix(mount): remove fid pool to stop master over-allocating volumes (#9111 )

2026-04-16 15:51:13 -07:00

filer_server_handlers_write_merge.go

S3 API: Add SSE-KMS (#7144 )

2025-08-21 08:28:07 -07:00

filer_server_handlers_write_upload.go

fix(mount): remove fid pool to stop master over-allocating volumes (#9111 )

2026-04-16 15:51:13 -07:00

filer_server_handlers_write.go

fix(mount): remove fid pool to stop master over-allocating volumes (#9111 )

2026-04-16 15:51:13 -07:00

filer_server_handlers.go

fix Filer startup failure due to JWT on / path #8149 (#8167 )

2026-01-29 21:45:15 -08:00

filer_server_rocksdb.go

go fix

2026-02-20 18:42:00 -08:00

filer_server_tus_handlers.go

fix(mount): remove fid pool to stop master over-allocating volumes (#9111 )

2026-04-16 15:51:13 -07:00

filer_server_tus_session.go

Add TUS protocol support for resumable uploads (#7592 )

2025-12-14 21:56:07 -08:00

filer_server.go

chore(filer): remove -mount.p2p flag; registry is always on (#9183 )

2026-04-21 23:00:11 -07:00

master_grpc_server_admin.go

Fix stale admin lock metric when lock expires and is reacquired (#8859 )

2026-03-30 18:51:38 -07:00

master_grpc_server_assign.go

feat: pass expected_data_size from clients for size-aware assignment (#9032 )

2026-04-11 11:30:47 -07:00

master_grpc_server_cluster.go

chore: execute goimports to format the code (#7983 )

2026-01-07 13:06:08 -08:00

master_grpc_server_collection.go

move to https://github.com/seaweedfs/seaweedfs

2022-07-29 00:17:28 -07:00

master_grpc_server_raft_test.go

Add cluster.raft.leader.transfer command for graceful leader change (#7819 )

2025-12-19 00:15:39 -08:00

master_grpc_server_raft.go

fix(admin): fix master leader link showing incorrect port in Admin UI (#8924 )

2026-04-04 11:50:43 -07:00

master_grpc_server_test.go

dlm: resilient distributed locks via consistent hashing + backup replication (#8860 )

2026-03-30 23:29:56 -07:00

master_grpc_server_volume.go

feat(master): drain pending size before marking volume readonly (#9036 )

2026-04-11 18:29:11 -07:00

master_grpc_server.go

dlm: resilient distributed locks via consistent hashing + backup replication (#8860 )

2026-03-30 23:29:56 -07:00

master_server_handlers_admin.go

fix: generate topology uuid uniformly in single-master mode (#8405 )

2026-02-22 23:45:48 -08:00

master_server_handlers_ui.go

hide millseconds in up time (#7553 )

2025-11-26 08:01:19 -08:00

master_server_handlers.go

feat: pass expected_data_size from clients for size-aware assignment (#9032 )

2026-04-11 11:30:47 -07:00

master_server.go

fix(wdclient,volume): compare master leader with ServerAddress.Equals (#9089 )

2026-04-15 12:29:31 -07:00

raft_common.go

fix: improve raft leader election reliability and failover speed (#8692 )

2026-03-18 23:28:07 -07:00

raft_hashicorp_test.go

Normalize hashicorp raft peer ids (#8253 )

2026-02-09 07:46:34 -08:00

raft_hashicorp.go

fix: improve raft leader election reliability and failover speed (#8692 )

2026-03-18 23:28:07 -07:00

raft_server_handlers.go

fix(wdclient,volume): compare master leader with ServerAddress.Equals (#9089 )

2026-04-15 12:29:31 -07:00

raft_server.go

go fmt

2026-04-10 17:31:14 -07:00

volume_grpc_admin.go

Give the ScrubVolume() RPC an option to flag found broken volumes as read-only. (#8360 )

2026-03-26 10:20:57 -07:00

volume_grpc_batch_delete.go

Block RPC write operations on volume servers when maintenance mode is enabled (#8115 )

2026-02-02 13:21:02 -08:00

volume_grpc_client_to_master.go

Export master_disconnections metrics on volume servers. (#9104 )

2026-04-17 15:15:26 -07:00

volume_grpc_copy_incremental.go

move to https://github.com/seaweedfs/seaweedfs

2022-07-29 00:17:28 -07:00

volume_grpc_copy.go

fix(volume_server): pin EC shard auto-select to the .ecx-owning disk (#9212 ) (#9245 )

2026-04-27 15:59:57 -07:00

volume_grpc_erasure_coding_test.go

iceberg: wire pagination for list namespaces/tables REST APIs (#8275 )

2026-02-09 21:46:55 -08:00

volume_grpc_erasure_coding.go

fix(volume_server): pin EC shard auto-select to the .ecx-owning disk (#9212 ) (#9245 )

2026-04-27 15:59:57 -07:00

volume_grpc_query.go

move to https://github.com/seaweedfs/seaweedfs

2022-07-29 00:17:28 -07:00

volume_grpc_read_all.go

Export gRPC file_{read,write}_failures metrics on volume servers. (#9177 )

2026-04-22 11:22:21 -07:00

volume_grpc_read_write.go

Export gRPC file_{read,write}_failures metrics on volume servers. (#9177 )

2026-04-22 11:22:21 -07:00

volume_grpc_remote.go

improve: large file sync throughput for remote.cache and filer.sync (#8676 )

2026-03-17 16:49:56 -07:00

volume_grpc_scrub.go

Give the ScrubVolume() RPC an option to flag found broken volumes as read-only. (#8360 )

2026-03-26 10:20:57 -07:00

volume_grpc_state.go

Add a version token on RPCs to read/update volume server states. (#8191 )

2026-02-06 10:58:43 -08:00

volume_grpc_tail.go

Block RPC write operations on volume servers when maintenance mode is enabled (#8115 )

2026-02-02 13:21:02 -08:00

volume_grpc_tier_download.go

avoid load volume file with BytesOffset mismatch (#3841 )

2022-10-14 00:18:09 -07:00

volume_grpc_tier_upload.go

Block RPC write operations on volume servers when maintenance mode is enabled (#8115 )

2026-02-02 13:21:02 -08:00

volume_grpc_vacuum.go

Block RPC write operations on volume servers when maintenance mode is enabled (#8115 )

2026-02-02 13:21:02 -08:00

volume_server_handlers_admin.go

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

volume_server_handlers_helper.go

…

volume_server_handlers_read.go

Export file_read_invalid_needles metric for REST read requests on invalid file IDs. (#9241 )

2026-04-27 12:22:42 -07:00

volume_server_handlers_ui.go

hide millseconds in up time (#7553 )

2025-11-26 08:01:19 -08:00

volume_server_handlers_write.go

Export REST file_{read,write}_failures metrics on volume servers (#9215 )

2026-04-24 11:45:21 -07:00

volume_server_handlers.go

fix: JWT validation failures during replication (#7788 ) (#7795 )

2025-12-16 13:42:18 -08:00

volume_server.go

Export start_time_seconds metrics on both master & volume servers. (#9046 )

2026-04-13 09:34:08 -07:00

webdav_server.go

fix(mount): remove fid pool to stop master over-allocating volumes (#9111 )

2026-04-16 15:51:13 -07:00

wrapped_webdav_fs.go

chore: execute goimports to format the code (#7983 )

2026-01-07 13:06:08 -08:00