nomulus

mirror of https://github.com/google/nomulus synced 2026-02-07 05:21:15 +00:00

Author	SHA1	Message	Date
gbrodman	ed25854fbc	Add unique index for not-deleted domain names (#2853 ) This is a backstop against multiple domain creations for the same domain name getting through	2025-10-24 15:38:23 +00:00
Juan Celhay	0aa6bc6aaa	Change regex format in release cb file (#2854 ) nomulus-20251024-RC00	2025-10-23 19:49:29 +00:00
Juan Celhay	ff4c326ebe	Delete step to push to release repo, trigger next release steps based on tag format (#2833 ) * Change release cb file * Add brackets around tag variable * Redo tag matching * Have tag matcher like the one in cb dev nomulus-20251022-RC00 nomulus-20251023-RC00	2025-10-21 18:52:42 +00:00
Pavlo Tkach	51b579871a	Anonymize support users in console history, add minor UI updates (#2851 ) nomulus-20251018-RC00 nomulus-20251019-RC00 proxy-20251020-RC00 nomulus-20251020-RC01 nomulus-20251020-RC00 nomulus-20251021-RC00	2025-10-17 18:57:40 +00:00
gbrodman	b144aafb22	Use transaction time for deletion time cache ticker (#2848 ) Basically, what happened is that the cache's expireAfterWrite was being called some number of milliseconds (say, 50-100) after the transaction was started. That method used the transaction time instead of the current time, so as a result the entries were sticking around 50-100ms longer in the cache than they should have been. This fix contains two parts, each of which I believe would be sufficient on their own to fix the issue: 1. Use the currentTime passed in in Expiry::expireAfterCreate 2. Use the transaction time in the cache's Ticker. This keeps everything on the same schedule. nomulus-20251017-RC00	2025-10-16 20:01:17 +00:00
Weimin Yu	ddd955e156	Fix dependency of Gradle task for schema test (#2849 ) Problem not showing up because all use cases run this test after `build`.	2025-10-16 15:33:28 +00:00
gbrodman	6863f678f1	Allow Gradle to use more heap space (#2847 ) During the release process, we are seeing the message "Gradle build daemon disappeared unexpectedly (it may have been killed or may have crashed)" which seemingly can be caused by OOMs nomulus-20251013-RC03 nomulus-20251014-RC00 nomulus-20251015-RC00 nomulus-20251016-RC00	2025-10-13 18:25:08 +00:00
gbrodman	6bd90e967b	Add more hash indexes used during common flows (#2845 ) I analyzed SQL statements run during the following flows and EXPLAIN ANALYZEd each of them to figure out if there are any additional hash indexes we could add that could be particularly helpful. Note: it's not worth adding a hash index on the host_repo_id field in DomainHost because so many rows (domains) use the same host. - domain create - domain delete - domain info - domain renew - domain update - host create - host delete - host update I skipped the ones that use the read-only replica, as well as contact flows (we're getting rid of them), and domain transfer/restore-related flows as those are extremely infrequent.	2025-10-13 18:07:47 +00:00
gbrodman	5faf3d283c	Differentiate between inserts and updates in flows (#2846 ) Updates (AKA merges) run an extra SELECT statement to figure out if the resource exists so that it can merge the entity into the existing object in Hibernate's schema. When we're inserting new rows (such as new poll messages or resource creates), we know that we don't need to do that merge. Doing this should save us some SELECT statements (this has borne out to be the truth in alpha)	2025-10-13 15:43:18 +00:00
gbrodman	149fb66ac5	Add cache for deletion times of existing domains (#2840 ) This should help in instances of popular domains dropping, since we won't need to do an additional two database loads every time (assuming the deletion time is in the future). nomulus-20251010-RC00 nomulus-20251011-RC00 nomulus-20251012-RC00 proxy-20251013-RC00 nomulus-20251013-RC00 nomulus-20251013-RC02 nomulus-20251013-RC01	2025-10-09 17:22:24 +00:00
gbrodman	8c96940a27	Only load from ClaimsList once when filling the cache (#2843 )	2025-10-09 16:57:21 +00:00
Ben McIlwain	9c5510f05d	Add a rate limiter to remove all domain contacts action (#2838 ) The maximum QPS defaults to 10, but can also be specified at runtime through use of a query-string parameter. BUG = http://b/439636188 nomulus-20251003-RC00 nomulus-20251004-RC00 nomulus-20251005-RC00 proxy-20251006-RC00 nomulus-20251006-RC00 nomulus-20251007-RC00 nomulus-20251008-RC00 nomulus-20251009-RC00	2025-10-02 22:15:19 +00:00
gbrodman	84884de77b	Verify existence of TLDs and registrars for tokens (#2837 ) Just in case someone makes a typo when running the commands	2025-10-02 20:10:58 +00:00
Ben McIlwain	d6c35df9bc	Ignore single domain failures in remove contacts from all domains action (#2836 ) When running the action in sandbox on 1.5M domains, it failed a few times updating individual domains (requiring a manual restart of the entire action). It's better to just log the individual failures for manual inspection and then otherwise continue running the action to process the vast majority of other updates that won't fail. BUG = http://b/439636188	2025-10-02 18:58:23 +00:00
Juan Celhay	7caa0ec9d6	Add environment configuration files to .gitignore (#2830 ) * Add environment configuration files to .gitignore * Delete config files from repo * Refactor release cb file to delete config file lines from gitignore * Reorder env files * Add README for config files	2025-10-02 18:36:43 +00:00
Weimin Yu	ee3866ec4a	Allow top level tld creation in Sandbox (#2835 ) Add a flag to the CreateCdnsTld command to bypass the dns name format check in Sandbox (limiting names to `*.test.`). With this flag, we can create TLDs for RST testing in Sandbox. Note that if the new flag is wrongly set for a disallowed name, the request to the Cloud DNS API will fail. The format check in the command just provides a user-friendly error message. nomulus-20251002-RC00	2025-10-01 14:20:33 +00:00
gbrodman	97d0b7680f	Add hash indexes for common use cases (#2834 ) I went through all the SQL statements generated by some sample DomainCreateFlow and DomainDeleteFlow cases to find situations where we were either SELECTing from, or UPDATEing, tables with a direct "field = value" format. These are the situations that I found where we can add hash indexes. This does two things: 1. Makes these queries slight faster, since these are usually queries on columns that are either unique or very close to unique, and O(1) is faster than O(log(n)) 2. Spreads around the optimistic predicate locks on the previously-used btree indexes. Many of our serialization errors came from the fact that we were autogenerating incrementing ID values for various tables, meaning that SELECTs, INSERTs, and UPDATEs would all try to take predicate locks out on the same page of the btree index. Using a hash index means that the page locks will be spread out to various index pages, rather than conflicting with each other. Running load tests on alpha I see significant improvements in speed and error rates. Speed is hard to quantify due to the nature of the way the load tests distribute tasks among the queues but it could be more than 50% improvement, and serialization errors in the logs drop by more than 90%. nomulus-20250930-RC01 nomulus-20250930-RC00 nomulus-20251001-RC00	2025-09-29 22:16:24 +00:00
Pavlo Tkach	5700a008d6	Add console history frontend (#2832 ) nomulus-20250927-RC00 nomulus-20250928-RC00 proxy-20250929-RC00 nomulus-20250929-RC00	2025-09-26 21:25:03 +00:00
Ben McIlwain	dc9f5b99bc	Add a batch action to remove all contacts from domains (#2827 ) This implements the first part of Minimum Data Set phase 3, wherein we delete all contact data. This action is necessary to leave a permanent record on the domain (in the form of a domain history entry) documenting when the contacts were removed by the administrative user. Then, after this has finished removing all contact assocations, we can simply empty out or drop the Contact/ContactHistory tables and associated join tables. nomulus-20250926-RC00	2025-09-25 20:47:17 +00:00
Ben McIlwain	d3c6de7a38	Modify the base Latin LGR with our intended changes to improve security (#2829 ) nomulus-20250925-RC00	2025-09-24 21:04:37 +00:00
Ben McIlwain	3c3303c16a	Add ICANN's reference Latin LGR in RFC 7940 XML format (#2828 ) In the next commit I will make changes to this file so it supports just the basic Latin characters that we want, but it's good to check the base version in so that we can see diffs. This was downloaded from https://www.icann.org/sites/default/files/packages/lgr/lgr-second-level-latin-script-25oct24-en.xml nomulus-20250920-RC00 nomulus-20250921-RC00 proxy-20250922-RC00 nomulus-20250922-RC00 nomulus-20250922-RC01 nomulus-20250923-RC00 nomulus-20250924-RC00	2025-09-19 16:42:46 +00:00
Nilay Shah	2a86a1bbe9	Skip user loading for proxy service account (#2825 ) * Skip user loading for proxy service account Reduces database load by skipping the User entity lookup for the proxy service account during OIDC authentication. The high volume of EPP "hello" and "login" commands from the proxy service account results in a constant database load. These lookups are unnecessary as the proxy service account is not expected to have a corresponding User object. This change optimizes the authentication flow by checking for the proxy service account email before attempting to load a User from the database. This bypasses the database transaction entirely for these high-volume requests. This approach is more efficient than caching, as it eliminates the database lookup for the proxy service account altogether, rather than just caching the result. * comment added and service account llokup time improved * comment updated for more clarity nomulus-20250917-RC00 nomulus-20250918-RC00 nomulus-20250919-RC00	2025-09-16 18:48:39 +00:00
gbrodman	ea148ac13e	Show success message on password reset (#2826 )	2025-09-16 18:39:19 +00:00
Nilay Shah	06299ccb86	Add cache for User entities in OIDC auth flow (#2822 ) * Add cache for User entities in OIDC auth flow * refactor: Address review feedback - Refactor database call into a single, reusable method - Increase the default cache size to 200 - Remove .recordStats() and using spy for testing - Split unit tests into separate implementation test that use Mockito spies instead of checking internal cache stats nomulus-20250913-RC00 nomulus-20250914-RC00 proxy-20250915-RC00 nomulus-20250915-RC00 nomulus-20250916-RC00	2025-09-12 07:43:32 +00:00
gbrodman	732c30b359	Remove registry-lock-related fields from RegistrarPoc (#2818 ) We've moved these over to the User class, so we should remove these for clarity. In addition, we should make it clear (in Java at least) that the field in the RegistryLock object refers to the email address used for the lock in question. nomulus-20250912-RC00	2025-09-11 15:29:06 +00:00
gbrodman	ee5a2d3916	Include internal registrars in the console (#2821 ) This allows us to also check / modify the CharlestonRoad registrar in the console, and also allows us to test actions (like password reset) using that registrar in the prod environment. nomulus-20250906-RC00 nomulus-20250907-RC00 proxy-20250908-RC00 nomulus-20250908-RC00 nomulus-20250909-RC00 nomulus-20250910-RC00 nomulus-20250911-RC00	2025-09-05 20:37:23 +00:00
gbrodman	2b5643df4c	Sort registrars list in console (#2820 ) This was bugging me slightly	2025-09-05 18:44:17 +00:00
Pavlo Tkach	6bbd7a2290	Update proxy resources, increase ssl handshake timeout (#2819 )	2025-09-05 18:09:55 +00:00
Weimin Yu	77ab80f3dc	Fix OOM in UploadBsaUnavailableDomains action (#2817 ) * Fix OOM in UploadBsaUnavailableDomains action The action was using string concatenation to generate the upload content. This causes an OOM when string length exceeds 25MB on our current VM. This PR witches to streaming upload. Also added an HTTP upload test. * Fix OOM in UploadBsaUnavailableDomains action The action was using string concatenation to generate the upload content. This causes an OOM when string length exceeds 25MB on our current VM. This PR witches to streaming upload. Also added an HTTP upload test. nomulus-20250904-RC00 nomulus-20250905-RC00	2025-09-03 18:25:56 +00:00
Pavlo Tkach	5e1cd0120f	Adjust proxy resource allocation and update nomulus compute class (#2814 ) nomulus-20250829-RC00 nomulus-20250830-RC00 nomulus-20250831-RC00 proxy-20250901-RC00 nomulus-20250901-RC00 nomulus-20250902-RC00 nomulus-20250903-RC00	2025-08-28 18:49:16 +00:00
Weimin Yu	0167dad85f	Fix OOM error in BsaValidation (#2813 ) Error happened in the case that an unblockable name reported with 'Registered' as reason has been deregistered. We tried to check the deletion time of the domain to decide if this is a transient error that is no worth reporting. However, we forgot that we do not have the domain key in this case. As best-effort action, and with a case that rarely happens, we decide not to make the optimization (staleness check) in thise case. nomulus-20250828-RC00	2025-08-27 15:47:13 +00:00
Weimin Yu	1eaf3d4aa8	Fix the schema-verifier in Cloud Build (#2812 ) This is the same problem as the one that broke the Java test: pg_dump upgrade to 17.6 added new meta command lines. nomulus-20250825-RC01 nomulus-20250826-RC00 nomulus-20250827-RC00	2025-08-25 17:17:49 +00:00
Pavlo Tkach	d9c46170dd	Add time-limited logs to Epp Login flow to test increased latency (#2810 ) nomulus-20250823-RC00 nomulus-20250824-RC00 proxy-20250825-RC00 nomulus-20250825-RC00	2025-08-22 20:30:53 +00:00
gbrodman	e8a475f48b	Remove Contact objects in RDAP output (#2811 ) Note: this still includes "contacts" for registrars, which are actually a different concept that we call RegistrarPoc. That's different from "Contact" objects, e.g. registrant.	2025-08-22 20:25:20 +00:00
gbrodman	bdaab9daa5	Tag more junit versions to <6.0 (#2809 ) We're still picking up some 6.0.0-M2 jars which are causing failures at least for me when trying to run individual tests in IDEA. nomulus-20250822-RC00	2025-08-22 02:55:59 +00:00
gbrodman	7e07fabf7e	Return RDAP 404 for domain w/nonexistent TLD (#2808 ) The TLD is technically valid but it doesn't exist for us -- we should return 404 instead of 400 in these situations according to the RDAP conformance docs	2025-08-21 15:31:51 +00:00
gbrodman	16859bb36a	Use https in RDAP URLs provided (#2807 ) Load balancer / internal redirections can result in the final request URL lacking "https" when finally getting to the servlet. As a result, even if you use https in the request, the resulting URL can be plain http. We need to include the actual (HTTPS) URL in the output, so replace it. nomulus-20250821-RC00	2025-08-20 14:15:54 +00:00
Weimin Yu	7c92928f2c	Update gradle dependency locks (#2806 ) Also emoved Junit-4. nomulus-20250820-RC00	2025-08-19 16:17:47 +00:00
Pavlo Tkach	de8d205657	Make k8s adjustments (#2803 ) This increases hikari fetch size to 40 from 20 in order to decrease the amount of round trips This also sets lower CPU as we seem to have overshot CPU consumtion This also set min replicas to 8 for EPP and max to 16 as we've been running on 8-10 for the last week nomulus-20250819-RC01	2025-08-19 14:24:11 +00:00
gbrodman	4738b979e4	Add FE for password-reset verification (#2795 ) Tested locally and on alpha with dummy values (and throwing an exception). I was able to reuse a bit of code from the EPP password reset, but not all of it. nomulus-20250819-RC00	2025-08-19 03:00:44 +00:00
Ben McIlwain	a61a667992	Add expiry_access_period_enabled boolean column to Tld table (#2804 ) This is the first in a series of PRs to implement the expiry access period (XAP). The overall fee schedules will be set in YAML config files, so the only DB change necessary should be this single new boolean column on the Tld entity, which defaults to false so as to require XAP explicitly being turned on for a given TLD. BUG=http://b/437398822	2025-08-18 22:32:46 +00:00
Weimin Yu	1164070576	Update golden schema comparison in schema test (#2805 ) Postgresql-17.6 introduces two new lines in pg_dump output as a security feature: `\restricted {HASH}` and `\unrestricted {HASH}`. We filter out lines starting with these two prefixes when comparing schemas. The db upgrade also adds two empty lines to the pg_dump output. We know ignore all empty lines when comparing schemas.	2025-08-18 20:41:47 +00:00
gbrodman	d23640a54f	Update LoadTestAction for GKE and no-contacts (#2800 ) It's necessary to remove the GAE-related code (and use GKE launch commands instead), and we might as well remove contact-related fields and actions because of the upcoming move to the minimum data set. nomulus-20250814-RC00 nomulus-20250815-RC00 nomulus-20250816-RC00 nomulus-20250817-RC00 proxy-20250818-RC00 nomulus-20250818-RC00	2025-08-13 18:35:46 +00:00
Pavlo Tkach	cc347264f1	Revert "Remove nodeSelector from k8s deployments (#2798 )" (#2799 ) This reverts commit `5cef2dd8b5`. We faced CPU quota issue with standard machine type, so rolling back to c4 for now to monitor server performance and decide if we want to try to downgrade again in the future.	2025-08-13 15:05:54 +00:00
Pavlo Tkach	e5d4cbb9fc	Increase hikari maximum pool size (#2802 ) nomulus-20250813-RC00	2025-08-12 21:01:28 +00:00
gbrodman	8c1e0ff4de	Chage run-time of DeleteExpiredDomainsAction (#2801 ) We probably want this to run before the billing recurrence expansion pipeline just in case there are any domains that should be deleted before their billing recurrence gets expanded.	2025-08-12 20:48:04 +00:00
Pavlo Tkach	5cef2dd8b5	Remove nodeSelector from k8s deployments (#2798 ) nodeSelector can limit scheduling capabilities of k8s, which leads to delays in assigning new workloads. Since we do not require and particular machine for execution it can be removed. nomulus-20250810-RC01 proxy-20250811-RC00 nomulus-20250811-RC00 nomulus-20250812-RC00	2025-08-10 16:16:48 +00:00
gbrodman	62b2585220	Fix load-testing URL in backend routing k8s file (#2797 ) nomulus-20250809-RC00 nomulus-20250810-RC00 proxy-20250810-RC00	2025-08-08 20:20:40 +00:00
sharma1210	8692fe35db	Provide specific reason for invalid SSL certificate (#2792 ) * Fix: Robustly parse certs and provide specific errors * Add test for expired certificate failure * fixing indentation * fixing indentation * Update SecurityActionTest.java * Update SecurityActionTest.java for correcting the testcase * Fix: Provide indentation fix * Fixing Deduplication in test	2025-08-08 18:41:14 +00:00
Pavlo Tkach	18614ba11e	Update resource allocation for all Nomulus GKE deployments (#2796 ) ALl deployments received update to averageUtilization cpu. This should allow us to stay ahead of the curve of traffic and create instances before we cpu reached the limit. Frontend cpu allocation has caused "noise neighbors" problem with pods assigned to nodes where there's not enough bursting capacity, so I increased it. Adjusted rest of the deployments according to their utilization.	2025-08-08 17:55:08 +00:00

1 2 3 4 5 ...

5103 Commits