anonymousland-synapse

mirror of https://git.anonymousland.org/anonymousland/synapse.git synced 2024-12-24 03:59:22 -05:00

Author	SHA1	Message	Date
Sean Quah	373c485d8c	Handle half-created indices in receipts index background update (#14650 ) When Synapse is terminated while running the background update to create the `receipts_graph` or `receipts_linearized` indexes, the indexes may be successfully created (or marked as invalid on postgres) while the background update remains unfinished. When Synapse next starts up, the background update will fail because the index already exists, or exists but is invalid on postgres. Use the existing code to create indices in background updates, since it handles these edge cases. Signed-off-by: Sean Quah <seanq@matrix.org>	2022-12-09 23:02:11 +00:00
Patrick Cloke	3ac412b4e2	Require types in tests.storage. (#14646 ) Adds missing type hints to `tests.storage` package and does not allow untyped definitions.	2022-12-09 12:36:32 -05:00
Erik Johnston	94bc21e69f	Limit the number of devices we delete at once (#14649 )	2022-12-09 13:31:32 +00:00
Erik Johnston	c2de2ca630	Delete stale non-e2e devices for users, take 2 (#14595 ) This should help reduce the number of devices e.g. simple bots the repeatedly login rack up. We only delete non-e2e devices as they should be safe to delete, whereas if we delete e2e devices for a user we may accidentally break their ability to receive e2e keys for a message.	2022-12-09 09:37:07 +00:00
reivilibre	cf1059d045	Fix a long-standing bug where the user directory would return 1 more row than requested. (#14631 )	2022-12-07 11:19:43 +00:00
Richard van der Hoff	cb59e08062	Improve logging and opentracing for to-device message handling (#14598 ) A batch of changes intended to make it easier to trace to-device messages through the system. The intention here is that a client can set a property org.matrix.msgid in any to-device message it sends. That ID is then included in any tracing or logging related to the message. (Suggestions as to where this field should be documented welcome. I'm not enthusiastic about speccing it - it's very much an optional extra to help with debugging.) I've also generally improved the data we send to opentracing for these messages.	2022-12-06 09:52:55 +00:00
Erik Johnston	cee9445884	Better return type for `get_all_entities_changed` (#14604 ) Help callers from using the return value incorrectly by ensuring that callers explicitly check if there was a cache hit or not.	2022-12-05 15:19:14 -05:00
reivilibre	501f62d1a6	Faster remote room joins: stream the un-partial-stating of rooms over replication. [rei:frrj/streams/unpsr] (#14473 )	2022-12-05 13:07:55 +00:00
Patrick Cloke	fac8a38525	Properly handle unknown results for the stream change cache. (#14592 ) StreamChangeCache.get_all_changed_entities can return None to signify it does not have information at the given stream position. Two callers (related to device lists and presence) were treating this response the same as an empty list (i.e. there being no updates).	2022-12-02 10:28:41 -05:00
David Robertson	781b14ec69	Merge branch 'release-v1.73' into develop	2022-12-01 13:43:30 +00:00
Nick Mills-Barrett	e8bce8999f	Aggregate unread notif count query for badge count calculation (#14255 ) Fetch the unread notification counts used by the badge counts in push notifications for all rooms at once (instead of fetching them per room).	2022-11-30 08:45:06 -05:00
David Robertson	c29e2c6306	Revert "POC delete stale non-e2e devices for users (#14038 )" (#14582 )	2022-11-29 17:48:48 +00:00
David Robertson	e860316818	Fix `UndefinedColumn: column "key_json" does not exist` errors when handling users with more than 50 non-E2E devices (#14580 )	2022-11-29 13:05:07 +00:00
Erik Johnston	c7e29ca277	POC delete stale non-e2e devices for users (#14038 ) This should help reduce the number of devices e.g. simple bots the repeatedly login rack up. We only delete non-e2e devices as they should be safe to delete, whereas if we delete e2e devices for a user we may accidentally break their ability to receive e2e keys for a message. Co-authored-by: Patrick Cloke <clokep@users.noreply.github.com> Co-authored-by: Sean Quah <8349537+squahtx@users.noreply.github.com>	2022-11-29 10:36:41 +00:00
Travis Ralston	9ccc09fe9e	Support MSC1767's `content.body` behaviour; Add base rules from MSC3933 (#14524 ) * Support MSC1767's `content.body` behaviour in push rules * Add the base rules from MSC3933 * Changelog entry * Flip condition around for finding `m.markup` * Remove forgotten import	2022-11-28 18:02:41 -07:00
Andrew Ferrazzutti	1183c372fa	Use `device_one_time_keys_count` to match MSC3202 (#14565 ) * Use `device_one_time_keys_count` to match MSC3202 Rename the `device_one_time_key_counts` key in responses to `device_one_time_keys_count` to match the name specified by MSC3202. Also change related variable/class names for consistency. Signed-off-by: Andrew Ferrazzutti <andrewf@element.io> * Update changelog.d/14565.misc * Revert name change for `one_time_key_counts` key as this is a different key altogether from `device_one_time_keys_count`, which is used for `/sync` instead of appservice transactions. Signed-off-by: Andrew Ferrazzutti <andrewf@element.io>	2022-11-28 16:17:29 +00:00
Sean Quah	f792dd74e1	Remove option to skip locking of tables during emulated upserts (#14469 ) To perform an emulated upsert into a table safely, we must either: * lock the table, * be the only writer upserting into the table * or rely on another unique index being present. When the 2nd or 3rd cases were applicable, we previously avoided locking the table as an optimization. However, as seen in #14406, it is easy to slip up when adding new schema deltas and corrupt the database. The only time we lock when performing emulated upserts is while waiting for background updates on postgres. On sqlite, we do no locking at all. Let's remove the option to skip locking tables, so that we don't shoot ourselves in the foot again. Signed-off-by: Sean Quah <seanq@matrix.org>	2022-11-28 13:42:06 +00:00
schmop	c2e06c36d4	Fix crash admin media list api when info is None (#14537 ) Fixes https://github.com/matrix-org/synapse/issues/14536	2022-11-24 10:49:04 +00:00
Erik Johnston	f38d7d79c8	Add another index to `device_lists_changes_in_room` (#14534 ) This helps avoid reading unnecessarily large amounts of data from the table when querying with a set of room IDs.	2022-11-23 14:09:00 +00:00
Sean Quah	9cae44f49e	Track unconverted device list outbound pokes using a position instead (#14516 ) When a local device list change is added to `device_lists_changes_in_room`, the `converted_to_destinations` flag is set to `FALSE` and the `_handle_new_device_update_async` background process is started. This background process looks for unconverted rows in `device_lists_changes_in_room`, copies them to `device_lists_outbound_pokes` and updates the flag. To update the `converted_to_destinations` flag, the database performs a `DELETE` and `INSERT` internally, which fragments the table. To avoid this, track unconverted rows using a `(stream ID, room ID)` position instead of the flag. From now on, the `converted_to_destinations` column indicates rows that need converting to outbound pokes, but does not indicate whether the conversion has already taken place. Closes #14037. Signed-off-by: Sean Quah <seanq@matrix.org>	2022-11-22 16:46:52 +00:00
Patrick Cloke	6d7523ef14	Batch fetch bundled references (#14508 ) Avoid an n+1 query problem and fetch the bundled aggregations for m.reference relations in a single query instead of a query per event. This applies similar logic for as was previously done for edits in `8b309adb43` (#11660; threads in `b65acead42` (#11752); and annotations in `1799a54a54` (#14491).	2022-11-22 09:41:09 -05:00
Patrick Cloke	1799a54a54	Batch fetch bundled annotations (#14491 ) Avoid an n+1 query problem and fetch the bundled aggregations for m.annotation relations in a single query instead of a query per event. This applies similar logic for as was previously done for edits in `8b309adb43` (#11660) and threads in `b65acead42` (#11752).	2022-11-22 07:26:11 -05:00
David Robertson	115f0eb233	Reintroduce #14376 , with bugfix for monoliths (#14468 ) * Add tests for StreamIdGenerator * Drive-by: annotate all defs * Revert "Revert "Remove slaved id tracker (#14376)" (#14463)" This reverts commit `d63814fd73`, which in turn reverted `36097e88c4`. This restores the latter. * Fix StreamIdGenerator not handling unpersisted IDs Spotted by @erikjohnston. Closes #14456. * Changelog Co-authored-by: Nick Mills-Barrett <nick@fizzadar.com> Co-authored-by: Erik Johnston <erik@matrix.org>	2022-11-16 22:16:46 +00:00
Patrick Cloke	d8cc86eff4	Remove redundant types from comments. (#14412 ) Remove type hints from comments which have been added as Python type hints. This helps avoid drift between comments and reality, as well as removing redundant information. Also adds some missing type hints which were simple to fill in.	2022-11-16 15:25:24 +00:00
Sean Quah	882277008c	Fix background updates failing to add unique indexes on receipts (#14453 ) As part of the database migration to support threaded receipts, there is a possible window in between `73/08thread_receipts_non_null.sql.postgres` removing the original unique constraints on `receipts_linearized` and `receipts_graph` and the `reeipts_linearized_unique_index` and `receipts_graph_unique_index` background updates from `72/08thread_receipts.sql` completing where the unique constraints on `receipts_linearized` and `receipts_graph` are missing. Any emulated upserts on these tables must therefore be performed with a lock held, otherwise duplicate rows can end up in the tables when there are concurrent emulated upserts. Fix the missing lock. Note that emulated upserts no longer happen by default on sqlite, since the minimum supported version of sqlite supports native upserts by default now. Finally, clean up any duplicate receipts that may have crept in before trying to create the `receipts_graph_unique_index` and `receipts_linearized_unique_index` unique indexes. Signed-off-by: Sean Quah <seanq@matrix.org>	2022-11-16 15:01:22 +00:00
Erik Johnston	d63814fd73	Revert "Remove slaved id tracker (#14376 )" (#14463 ) This reverts commit `36097e88c4`.	2022-11-16 13:50:07 +00:00
David Robertson	1eed795fc5	Include heroes in partial join responses' state (#14442 ) * Pull out hero selection logic * Include heroes in partial join response's state * Changelog * Fixup trial test * Remove TODO	2022-11-15 17:35:19 +00:00
reivilibre	634359b083	Update docstring to clarify that `get_partial_state_events_batch` does not just give you completely arbitrary partial-state events. (#14417 )	2022-11-15 10:43:17 +00:00
Nick Mills-Barrett	36097e88c4	Remove slaved id tracker (#14376 ) This matches the multi instance writer ID generator class which can both handle advancing the current token over replication and by calling the database.	2022-11-14 17:31:36 +00:00
Patrick Cloke	fb66fae84b	Clean-up events persistance code (#14411 ) By removing unused variables and making some arguments required which are always provided.	2022-11-14 08:13:11 -05:00
Nick Mills-Barrett	3a4f80f8c6	Merge/remove `Slaved*` stores into `WorkerStores` (#14375 )	2022-11-11 10:51:49 +00:00
Patrick Cloke	e9a4343cb2	Drop support for Postgres 10 in full text search code. (#14397 )	2022-11-09 09:55:34 -05:00
Richard van der Hoff	2193513346	Fix background update table-scanning `events` (#14374 ) When this background update did its last batch, it would try to update all the events that had been inserted since the bgupdate started, which could cause a table-scan. Make sure we limit the update correctly.	2022-11-07 14:28:00 +00:00
Brendan Abolivier	86c5a710d8	Implement MSC3912: Relation-based redactions (#14260 ) Co-authored-by: Sean Quah <8349537+squahtx@users.noreply.github.com>	2022-11-03 16:21:31 +00:00
Quentin Gliech	cc3a52b33d	Support OIDC backchannel logouts (#11414 ) If configured an OIDC IdP can log a user's session out of Synapse when they log out of the identity provider. The IdP sends a request directly to Synapse (and must be configured with an endpoint) when a user logs out.	2022-10-31 13:07:30 -04:00
Andrew Morgan	7911e2835d	Prevent federation user keys query from returning device names if disallowed (#14304 )	2022-10-28 18:06:02 +01:00
Patrick Cloke	81815e0561	Switch search SQL to triple-quote strings. (#14311 ) For ease of reading we switch from concatenated strings to triple quote strings.	2022-10-28 11:44:10 -04:00
Eric Eastwood	aa70556699	Check appservice user interest against the local users instead of all users (`get_users_in_room` mis-use) (#13958 )	2022-10-27 18:29:23 +00:00
Patrick Cloke	67583281e3	Fix tests for change in PostgreSQL 14 behavior change. (#14310 ) PostgreSQL 14 changed the behavior of `websearch_to_tsquery` to improve some behaviour. The tests were hitting those edge-cases about handling of hanging double quotes. This fixes the tests to take into account the PostgreSQL version.	2022-10-27 13:58:12 +00:00
Mathieu Velten	4dc05f3019	Fix presence bug introduced in 1.64 by #13313 (#14243 ) * Fix presence bug introduced in 1.64 by #13313 Signed-off-by: Mathieu Velten <mathieuv@matrix.org> * Add changelog * Add DISTINCT * Apply suggestions from code review Signed-off-by: Mathieu Velten <mathieuv@matrix.org>	2022-10-27 13:16:00 +01:00
Quentin Gliech	8756d5c87e	Save login tokens in database (#13844 ) * Save login tokens in database Signed-off-by: Quentin Gliech <quenting@element.io> * Add upgrade notes * Track login token reuse in a Prometheus metric Signed-off-by: Quentin Gliech <quenting@element.io>	2022-10-26 11:45:41 +01:00
James Salter	d902181de9	Unified search query syntax using the full-text search capabilities of the underlying DB. (#11635 ) Support a unified search query syntax which leverages more of the full-text search of each database supported by Synapse. Supports, with the same syntax across Postgresql 11+ and Sqlite: - quoted "search terms" - `AND`, `OR`, `-` (negation) operators - Matching words based on their stem, e.g. searches for "dog" matches documents containing "dogs". This is achieved by - If on postgresql 11+, pass the user input to `websearch_to_tsquery` - If on sqlite, manually parse the query and transform it into the sqlite-specific query syntax. Note that postgresql 10, which is close to end-of-life, falls back to using `phraseto_tsquery`, which only supports a subset of the features. Multiple terms separated by a space are implicitly ANDed. Note that: 1. There is no escaping of full-text syntax that might be supported by the database; e.g. `NOT`, `NEAR`, `*` in sqlite. This runs the risk that people might discover this as accidental functionality and depend on something we don't guarantee. 2. English text is assumed for stemming. To support other languages, either the target language needs to be known at the time of indexing the message (via room metadata, or otherwise), or a separate index for each language supported could be created. Sqlite docs: https://www.sqlite.org/fts3.html#full_text_index_queries Postgres docs: https://www.postgresql.org/docs/11/textsearch-controls.html	2022-10-25 14:05:22 -04:00
Olivier Wilkinson (reivilibre)	85fcbba595	Merge branch 'release-v1.70' into develop	2022-10-25 15:39:35 +01:00
DeepBlueV7.X	2d0ba3f89a	Implementation for MSC3664: Pushrules for relations (#11804 )	2022-10-25 14:38:01 +01:00
Patrick Cloke	581b37b5d6	Revert behavior change for bundling edits of non-message events (#14283 )	2022-10-24 17:07:16 +01:00
Richard van der Hoff	1469fed0e3	Add debugging to help diagnose lost device-list-update (#14268 )	2022-10-24 10:45:10 +01:00
Patrick Cloke	4dd7aa371b	Properly update the threads table when thread events are redacted. (#14248 ) When the last event in a thread is redacted we need to update the threads table: * Find the new latest event in the thread and store it into the table; or * Remove the thread from the table if it is no longer a thread (i.e. all events in the thread were redacted).	2022-10-21 09:11:19 -04:00
Tadeusz Sośnierz	1433b5d5b6	Show erasure status when listing users in the Admin API (#14205 ) * Show erasure status when listing users in the Admin API * Use USING when joining erased_users * Add changelog entry * Revert "Use USING when joining erased_users" This reverts commit 30bd2bf106415caadcfdbdd1b234ef2b106cc394. * Make the erased check work on postgres * Add a testcase for showing erased user status * Appease the style linter * Explicitly convert `erased` to bool to make SQLite consistent with Postgres This also adds us an easy way in to fix the other accidentally integered columns. * Move erasure status test to UsersListTestCase * Include user erased status when fetching user info via the admin API * Document the erase status in user_admin_api * Appease the linter and mypy * Signpost comments in tests Co-authored-by: Tadeusz Sośnierz <tadeusz@sosnierz.com> Co-authored-by: David Robertson <david.m.robertson1@gmail.com>	2022-10-21 13:52:44 +01:00
dependabot[bot]	0b7830e457	Bump flake8-bugbear from 21.3.2 to 22.9.23 (#14042 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Erik Johnston <erik@matrix.org> Co-authored-by: David Robertson <davidr@element.io>	2022-10-19 19:38:24 +00:00
Eric Eastwood	fa8616e65c	Fix MSC3030 `/timestamp_to_event` returning `outliers` that it has no idea whether are near a gap or not (#14215 ) Fix MSC3030 `/timestamp_to_event` endpoint returning `outliers` that it has no idea whether are near a gap or not (and therefore unable to determine whether it's actually the closest event). The reason Synapse doesn't know whether an `outlier` is next to a gap is because our gap checks rely on entries in the `event_edges`, `event_forward_extremeties`, and `event_backward_extremities` tables which is [not the case for `outliers`](`2c63cdcc3f/docs/development/room-dag-concepts.md (outliers)`). Also fixes MSC3030 Complement `can_paginate_after_getting_remote_event_from_timestamp_to_event_endpoint` test flake. Although this acted flakey in Complement, if `sync_partial_state` raced and beat us before `/timestamp_to_event`, then even if we retried the failing `/context` request it wouldn't work until we made this Synapse change. With this PR, Synapse will never return an `outlier` event so that test will always go and ask over federation. Fix https://github.com/matrix-org/synapse/issues/13944 ### Why did this fail before? Why was it flakey? Sleuthing the server logs on the [CI failure](https://github.com/matrix-org/synapse/actions/runs/3149623842/jobs/5121449357#step:5:5805), it looks like `hs2:/timestamp_to_event` found `$NP6-oU7mIFVyhtKfGvfrEQX949hQX-T-gvuauG6eurU` as an `outlier` event locally. Then when we went and asked for it via `/context`, since it's an `outlier`, it was filtered out of the results -> `You don't have permission to access that event.` This is reproducible when `sync_partial_state` races and persists `$NP6-oU7mIFVyhtKfGvfrEQX949hQX-T-gvuauG6eurU` as an `outlier` before we evaluate `get_event_for_timestamp(...)`. To consistently reproduce locally, just add a delay at the [start of `get_event_for_timestamp(...)`](`cb20b885cb/synapse/handlers/room.py (L1470-L1496)`) so it always runs after `sync_partial_state` completes. ```py from twisted.internet import task as twisted_task d = twisted_task.deferLater(self.hs.get_reactor(), 3.5) await d ``` In a run where it passes, on `hs2`, `get_event_for_timestamp(...)` finds a different event locally which is next to a gap and we request from a closer one from `hs1` which gets backfilled. And since the backfilled event is not an `outlier`, it's returned as expected during `/context`. With this PR, Synapse will never return an `outlier` event so that test will always go and ask over federation.	2022-10-18 19:46:25 -05:00

1 2 3 4 5 ...

863 Commits