test/cql-pytest: regression test for index scan with start token

When we have a table with partition key p and an indexed regular column v, the test included in this patch checks the query SELECT p FROM table WHERE v = 1 AND TOKEN(p) > 17 This can work and not require ALLOW FILTERING, because the secondary index posting-list of "v=1" is ordered in p's token order (to allow SELECT with and without an index to return the same order - this is explained in issue #7443). So this test should pass, and indeed it does on both current Scylla, and Cassandra. However, it turns out that this was a bug - issue #7043 - in older versions of Scylla, and only fixed in Scylla 4.6. In older versions, the SELECT wasn't accepted, claiming it requires ALLOW FILTERING, and if ALLOW FILTERING was added, the TOKEN(p) > 17 part was silently ignored. The fix for issue #7043 actually included regression tests, C++ tests in test/boost/secondary_index_test.cc. But in this patch we also add a Python test in test/cql-pytest. One of the benefits of cql-pytest is that we can (and I did) run the same test on Cassandra to verify we're not implementing a wrong feature. Another benefit is that we can run a new test on an old version, and not even require re-compilation: You can run this new test on any existing installation of Scylla to check if it still has issue #7043. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #12237
2026-05-17 05:12:13 +00:00 · 2022-12-07 23:27:55 +02:00
parent 7197757750
commit e47794ed98
1 changed files with 29 additions and 0 deletions
--- a/test/cql-pytest/test_secondary_index.py
+++ b/test/cql-pytest/test_secondary_index.py
@@ -1502,3 +1502,32 @@ def test_disallow_local_indexes_on_static_columns(scylla_only, cql, test_keyspac
    with new_test_table(cql, test_keyspace, schema) as table:
        with pytest.raises(InvalidRequest, match="Local indexes containing static columns are not supported"):
            cql.execute(f'CREATE INDEX ON {table}((pk), s)')
+
+# Check that a "SELECT p FROM table WHERE v = 1 AND token(p) > ..."
+# works where p is a base table's partition key and  v is an indexed
+# regular column. For this to work, it requires that a secondary index
+# should hold the list of matching p's in token order, and for the
+# query planner to understand this (so ALLOW FILTERING isn't necessary).
+# Reproduces issue #7043. See also C++ tests
+# test_select_with_token_range_*() in test/boost/secondary_index_test.cc
+def test_index_filter_by_token(cql, test_keyspace):
+    schema = 'p int, v int, primary key (p)'
+    with new_test_table(cql, test_keyspace, schema) as table:
+        cql.execute(f"CREATE INDEX ON {table}(v)")
+        insert = cql.prepare(f"INSERT INTO {table}(p,v) VALUES (?,?)")
+        for i in range(10):
+            # All of these have v = 1:
+            cql.execute(insert, [i, 1])
+        # Scan the base table in token order
+        base_results = list(cql.execute(f"SELECT token(p), p FROM {table}"))
+        # Scanning the secondary index yields the same order
+        # (see also test_partition_order_with_si() above)
+        si_results = list(cql.execute(f"SELECT token(p), p FROM {table} WHERE v = 1"))
+        assert si_results == base_results
+        # Find one of the tokens token and add a "token(p) >" restriction
+        # Check that the token(p) > .. restriction works (yielding just
+        # a subset of base_results) and also the v = ... works together
+        # (v=1 yields the data, v=0 yields nothing).
+        token3 = base_results[3].system_token_p
+        assert base_results[4:] == list(cql.execute(f"SELECT token(p), p FROM {table} WHERE v = 1 AND token(p) > {token3}"))
+        assert [] == list(cql.execute(f"SELECT token(p), p FROM {table} WHERE v = 0 AND token(p) > {token3}"))