forked-synapse

mirror of https://mau.dev/maunium/synapse.git synced 2024-10-01 01:36:05 -04:00

Author	SHA1	Message	Date
Shay	9edb725ebc	Support MSC3916 by adding unstable media endpoints to `_matrix/client` (#17213 ) [MSC3916](https://github.com/matrix-org/matrix-spec-proposals/blob/rav/authentication-for-media/proposals/3916-authentication-for-media.md) adds new media endpoints under `_matrix/client`. This PR adds the `/preview_url`, `/config`, and `/thumbnail` endpoints. `/download` will be added in a follow-up PR once the work for the federation `/download` endpoint is complete (see https://github.com/element-hq/synapse/pull/17172). Should be reviewable commit-by-commit.	2024-05-24 09:47:37 +01:00
Eric Eastwood	c97251d5ba	Add Sliding Sync `/sync/e2ee` endpoint for To-Device messages (#17167 ) This is being introduced as part of Sliding Sync but doesn't have any sliding window component. It's just a way to get E2EE events without having to sit through a big initial sync (`/sync` v2). And we can avoid encryption events being backed up by the main sync response or vice-versa. Part of some Sliding Sync simplification/experimentation. See [this discussion](https://github.com/element-hq/synapse/pull/17167#discussion_r1610495866) for why it may not be as useful as we thought. Based on: - https://github.com/matrix-org/matrix-spec-proposals/pull/3575 - https://github.com/matrix-org/matrix-spec-proposals/pull/3885 - https://github.com/matrix-org/matrix-spec-proposals/pull/3884	2024-05-23 12:06:16 -05:00
reivilibre	7e2412265d	Log exceptions when failing to auto-join new user according to the `auto_join_rooms` option. (#17176 ) Would have been useful for tracking down #16878. Signed-off-by: Olivier 'reivilibre <oliverw@matrix.org>	2024-05-22 14:22:33 +01:00
reivilibre	7ef00b7628	Add logging to tasks managed by the task scheduler, showing CPU and database usage. (#17219 ) The log format is the same as the request log format, except: - fields that are specific to HTTP requests have been removed - the task's params are included at the end of the log line. These log lines are emitted: - when the task function finishes — both completion and failure (and I suppose it is possible for a task to become schedulable again?) - every 5 minutes whilst it is running Closes #17217. --------- Signed-off-by: Olivier 'reivilibre <oliverw@matrix.org>	2024-05-22 14:12:58 +01:00
Erik Johnston	b71d277438	Reduce work of calculating outbound device pokes (#17211 )	2024-05-22 13:55:18 +01:00
devonh	6a9a641fb8	Bring auto-accept invite logic into Synapse (#17147 ) This PR ports the logic from the [synapse_auto_accept_invite](https://github.com/matrix-org/synapse-auto-accept-invite) module into synapse. I went with the naive approach of injecting the "module" next to where third party modules are currently loaded. If there is a better/preferred way to handle this, I'm all ears. It wasn't obvious to me if there was a better location to add this logic that would cleanly apply to all incoming invite events. Relies on https://github.com/element-hq/synapse/pull/17166 to fix linter errors.	2024-05-21 20:09:17 +00:00
Erik Johnston	b5facbac0f	Improve perf of sync device lists (#17216 ) Re-introduces #17191, and includes #17197 and #17214 The basic idea is to stop calling `get_rooms_for_user` everywhere, and instead use the table `device_lists_changes_in_room`. Commits reviewable one-by-one.	2024-05-21 16:48:20 +01:00
Erik Johnston	52af16c561	Add a short sleep if the request is rate-limited (#17210 ) This helps prevent clients from "tight-looping" retrying their request.	2024-05-18 12:03:30 +01:00
Eric Eastwood	c856ae4724	Refactor `SyncResultBuilder` assembly to its own function (#17202 ) We will re-use `get_sync_result_builder(...)` in https://github.com/element-hq/synapse/pull/17167 Split out from https://github.com/element-hq/synapse/pull/17167	2024-05-16 13:05:31 -05:00
Eric Eastwood	fe07995e69	Fix `joined_rooms`/`joined_room_ids` usage (#17208 ) This change was introduced in https://github.com/element-hq/synapse/pull/17203 But then https://github.com/element-hq/synapse/pull/17207 was reverted which brought back usage `joined_rooms` that needed to be updated. Wasn't caught because `develop` wasn't up to date before merging.	2024-05-16 17:27:38 +00:00
Eric Eastwood	52a649580f	Rename to be obvious: `joined_rooms` -> `joined_room_ids` (#17203 ) Split out from https://github.com/element-hq/synapse/pull/17167	2024-05-16 11:55:51 -05:00
Eric Eastwood	28a948f04f	Removed `request_key` from the `SyncConfig` (moved outside as its own function parameter) (#17201 ) Removed `request_key` from the `SyncConfig` (moved outside as its own function parameter) so it doesn't have to flow into `_generate_sync_entry_for_xxx` methods. This way we can separate the concerns of caching from generating the response and reuse the `_generate_sync_entry_for_xxx` functions as we see fit. Plus caching doesn't really have anything to do with the config of sync. Split from https://github.com/element-hq/synapse/pull/17167 Spawning from https://github.com/element-hq/synapse/pull/17167#discussion_r1601497279	2024-05-16 11:54:46 -05:00
Erik Johnston	fd12003441	Revert "Improve perf of sync device lists" (#17207 ) Reverts element-hq/synapse#17191	2024-05-16 16:07:54 +01:00
Erik Johnston	5e892671a7	Fix bug where push rules would be empty in `/sync` (#17142 ) Fixes #16987 Some old accounts seem to have an entry in global account data table for push rules, which we should ignore	2024-05-16 15:04:14 +01:00
Eric Eastwood	d2d48cce85	Refactor Sync handler to be able to return different sync responses (`SyncVersion`) (#17200 ) Refactor Sync handler to be able to be able to return different sync responses (`SyncVersion`). Preparation to be able support sync v2 and a new Sliding Sync `/sync/e2ee` endpoint which returns a subset of sync v2. Split upon request: https://github.com/element-hq/synapse/pull/17167#discussion_r1601497279 Split from https://github.com/element-hq/synapse/pull/17167 where we will add `SyncVersion.E2EE_SYNC` and a new type of sync response.	2024-05-16 11:36:54 +01:00
Erik Johnston	284d85dee3	Cache literal sync filter validation (#17186 ) The sliding sync proxy (amongst other things) use literal json blobs as filters, and repeatedly validating them takes a bunch of CPU.	2024-05-14 15:08:46 +01:00
Erik Johnston	ebe77381b0	Reduce pauses on large device list changes (#17192 ) For large accounts waking up all the relevant notifier streams can cause pauses of the reactor.	2024-05-14 14:39:11 +01:00
Erik Johnston	0b91ccce47	Improve perf of sync device lists (#17191 ) It's almost always more efficient to query the rooms that have device list changes, rather than looking at the list of all users whose devices have changed and then look for shared rooms.	2024-05-14 14:39:04 +01:00
Aurélien Grimpard	7d82987b27	Allows CAS SSO flow to provide user IDs composed of numbers only (#17098 )	2024-05-14 13:55:32 +01:00
Erik Johnston	038b9ec59a	An federation whitelist query endpoint extension (#16848 ) This is to allow clients to query the configured federation whitelist. Disabled by default. --------- Co-authored-by: Devon Hudson <devonhudson@librem.one> Co-authored-by: devonh <devon.dmytro@gmail.com> Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com>	2024-05-13 19:38:45 +00:00
Erik Johnston	59ac541310	Actually fix public rooms (#17184 ) See #17177. I'm an idiot and moved them to the wrong store 🤦	2024-05-13 13:11:07 +01:00
Erik Johnston	a2e6f43f11	Fix bug with creating public rooms on workers (#17177 ) If room publication is disabled then creating public rooms on workers would not work. Introduced in #16811.	2024-05-13 12:12:26 +01:00
devonh	393429d692	Fix undiscovered linter errors (#17166 ) Linter errors are showing up in #17147 that are unrelated to that PR. The errors do not currently show up on develop. This PR aims to resolve the linter errors separately from #17147.	2024-05-08 14:57:32 +00:00
Timshel	34a8652366	Optional whitespace support in Authorization (#1350 ) (#17145 ) Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com>	2024-05-08 13:56:16 +00:00
Hugh Nimmo-Smith	212f150208	Add note about MSC3886 being closed (#17151 )	2024-05-08 12:49:32 +01:00
Erik Johnston	3e6ee8ff88	Add optimisation to `StreamChangeCache` (#17130 ) When there have been lots of changes compared with the number of entities, we can do a fast(er) path. Locally I ran some benchmarking, and the comparison seems to give the best determination of which method we use.	2024-05-06 12:56:52 +01:00
Erik Johnston	7c9ac01eb5	Fix bug where `StreamChangeCache` would not respect cache factors (#17152 ) Annoyingly mypy didn't pick up this typo.	2024-05-03 18:00:08 +01:00
Shay	37558d5e4c	Add support for MSC3823 - Account Suspension (#17051 )	2024-05-01 17:45:17 +01:00
devonh	7ab0f630da	Apply user `email` & `picture` during OIDC registration if present & selected (#17120 ) This change will apply the `email` & `picture` provided by OIDC to the new user account when registering a new user via OIDC. If the user is directed to the account details form, this change makes sure they have been selected before applying them, otherwise they are omitted. In particular, this change ensures the values are carried through when Synapse has consent configured, and the redirect to the consent form/s are followed. I have tested everything manually. Including: - with/without consent configured - allowing/not allowing the use of email/avatar (via `sso_auth_account_details.html`) - with/without automatic account detail population (by un/commenting the `localpart_template` option in synapse config). ### Pull Request Checklist <!-- Please read https://element-hq.github.io/synapse/latest/development/contributing_guide.html before submitting your pull request --> * [X] Pull request is based on the develop branch * [X] Pull request includes a [changelog file](https://element-hq.github.io/synapse/latest/development/contributing_guide.html#changelog). The entry should: - Be a short description of your change which makes sense to users. "Fixed a bug that prevented receiving messages from other servers." instead of "Moved X method from `EventStore` to `EventWorkerStore`.". - Use markdown where necessary, mostly for `code blocks`. - End with either a period (.) or an exclamation mark (!). - Start with a capital letter. - Feel free to credit yourself, by adding a sentence "Contributed by @github_username." or "Contributed by [Your Name]." to the end of the entry. * [X] [Code style](https://element-hq.github.io/synapse/latest/code_style.html) is correct (run the [linters](https://element-hq.github.io/synapse/latest/development/contributing_guide.html#run-the-linters))	2024-04-29 15:23:05 +00:00
Richard van der Hoff	b548f7803a	Add support for MSC4115 (#17104 ) Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com>	2024-04-29 15:22:13 +01:00
Richard van der Hoff	c897ac63e9	Ensure that incoming to-device messages are not dropped (#17127 ) ... when workers are unreachable, etc. Fixes https://github.com/element-hq/synapse/issues/17117. The general principle is just to make sure that we propagate any exceptions to the JsonResource, so that we return an error code to the sending server. That means that the sending server no longer considers the message safely sent, so it will retry later. In the issue, Erik mentions that an alternative solution would be to persist the to-device messages into a table so that they can be retried. This might be an improvement for performance, but even if we did that, we still need this mechanism, since we might be unable to reach the database. So, if we want to do that, it can be a later follow-up. --------- Co-authored-by: Erik Johnston <erik@matrix.org>	2024-04-29 14:11:00 +01:00
Patrick Cloke	38bc7a009d	Declare support for Matrix v1.10. (#17082 ) Pretty straightforward. 😄 Fixes #17021	2024-04-29 14:09:03 +01:00
Andrew Morgan	89fc579329	Fix filtering of rooms when supplying the `destination` query parameter to `/_synapse/admin/v1/federation/destinations/<destination>/rooms` (#17077 )	2024-04-26 10:52:24 +01:00
Michael Telatynski	41fbe387d6	Improve error message for cross signing reset with MSC3861 enabled (#17121 )	2024-04-26 09:54:30 +01:00
Andrew Ferrazzutti	516fd891ee	Use recommended endpoint for MSC3266 requests (#17078 ) Keep the existing endpoint for backwards compatibility Signed-off-by: Andrew Ferrazzutti <andrewf@element.io>	2024-04-26 09:46:42 +01:00
Melvyn Laïly	59710437e4	Return the search terms as search highlights for SQLite instead of nothing (#17000 ) Fixes https://github.com/element-hq/synapse/issues/16999 and https://github.com/element-hq/element-android/pull/8729 by returning the search terms as search highlights.	2024-04-26 09:43:52 +01:00
Till	47773232b0	Redact membership events if the user requested erasure upon deactivating (#17076 ) Fixes #15355 by redacting all membership events before leaving rooms.	2024-04-25 14:25:31 +01:00
Quentin Gliech	2e92b718d5	MSC4108 implementation (#17056 ) Co-authored-by: Hugh Nimmo-Smith <hughns@element.io> Co-authored-by: Hugh Nimmo-Smith <hughns@users.noreply.github.com> Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com>	2024-04-25 12:50:12 +00:00
Andrew Morgan	646cb6ff24	Add type annotation to `visited_chains` (#17125 ) This should fix CI on `develop`. Broke in `0fe9e1f7da`, presumably due to a `mypy` dependency upgrade.	2024-04-25 12:25:26 +00:00
Erik Johnston	0fe9e1f7da	Merge branch 'master' into develop	2024-04-23 17:06:52 +01:00
mcalinghee	ae181233aa	Send an email if the address is already bound to an user account (#16819 ) Co-authored-by: Mathieu Velten <mathieu.velten@beta.gouv.fr> Co-authored-by: Olivier D <odelcroi@gmail.com>	2024-04-23 16:45:24 +01:00
Erik Johnston	55b0aa847a	Fix GHSA-3h7q-rfh9-xm4v Weakness in auth chain indexing allows DoS from remote room members through disk fill and high CPU usage. A remote Matrix user with malicious intent, sharing a room with Synapse instances before 1.104.1, can dispatch specially crafted events to exploit a weakness in how the auth chain cover index is calculated. This can induce high CPU consumption and accumulate excessive data in the database of such instances, resulting in a denial of service. Servers in private federations, or those that do not federate, are not affected.	2024-04-23 15:25:49 +01:00
Gordan Trevis	1d47532310	Parse json validation (#16923 ) Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com>	2024-04-18 13:57:38 +01:00
Erik Johnston	803f05f60c	Fix remote receipts for events we don't have (#17096 ) Introduced in #17032	2024-04-17 16:08:40 +01:00
Quentin Gliech	c8e0bed426	Support for MSC4108 via delegation (#17086 ) This adds support for MSC4108 via delegation, similar to what has been done for MSC3886 --------- Co-authored-by: Hugh Nimmo-Smith <hughns@element.io>	2024-04-17 16:47:35 +02:00
Gordan Trevis	f0d6f14047	Parse Integer negative value validation (#16920 )	2024-04-16 19:12:36 +00:00
Kegan Dougal	259442fa4c	bugfix: make msc3967 idempotent (#16943 ) MSC3967 was updated recently to make it more robust to network failures: > there is an existing cross-signing master key and it exactly matches the cross-signing master key provided in the request body. If there are any additional keys provided in the request (self signing key, user signing key) they MUST also match the existing keys stored on the server. In other words, the request contains no new keys. If there are new keys, UIA MUST be performed. https://github.com/matrix-org/matrix-spec-proposals/blob/hughns/device-signing-upload-uia/proposals/3967-device-signing-upload-uia.md#proposal This covers the case where the 200 OK is lost in transit so the client retries the upload, only to then get UIA'd. Complement tests: https://github.com/matrix-org/complement/pull/713 - passing example https://github.com/element-hq/synapse/actions/runs/7976948122/job/21778795094?pr=16943#step:7:8820 ### Pull Request Checklist <!-- Please read https://element-hq.github.io/synapse/latest/development/contributing_guide.html before submitting your pull request --> * [x] Pull request is based on the develop branch * [x] Pull request includes a [changelog file](https://element-hq.github.io/synapse/latest/development/contributing_guide.html#changelog). The entry should: - Be a short description of your change which makes sense to users. "Fixed a bug that prevented receiving messages from other servers." instead of "Moved X method from `EventStore` to `EventWorkerStore`.". - Use markdown where necessary, mostly for `code blocks`. - End with either a period (.) or an exclamation mark (!). - Start with a capital letter. - Feel free to credit yourself, by adding a sentence "Contributed by @github_username." or "Contributed by [Your Name]." to the end of the entry. * [x] [Code style](https://element-hq.github.io/synapse/latest/code_style.html) is correct (run the [linters](https://element-hq.github.io/synapse/latest/development/contributing_guide.html#run-the-linters)) --------- Co-authored-by: reivilibre <oliverw@matrix.org>	2024-04-15 10:57:56 +00:00
Nick Mills-Barrett	fe4719a268	Use receipts `event_stream_ordering` instead of joins (#17032 ) Resurrecting https://github.com/matrix-org/synapse/pull/13918. This should reduce IOPs incurred by joining to the events table to lookup stream ordering, which happens in many receipt handling code paths. Like the previous PR I believe sufficient time has passed between the original migration in DB schema 72 and now to merge this as-is. It's highly unlikely that both the migration is still ongoing AND (active) users still have any receipts prior to that date. In the unlikely event there is a receipt without a populated `event_stream_ordering` synapse will behave just as it does now when receipts exist for events that don't (yet): for push action calculation the receipts are just ignored. I've removed the validation on event IDs as this is already covered here: `59ceabcb97/synapse/handlers/receipts.py (L189-L192)`	2024-04-12 09:28:44 +01:00
Erik Johnston	3a30846bd0	Fix mypy on latest Twisted release (#17036 ) `ITransport.abortConnection` isn't a thing, but `HTTPChannel.forceAbortClient` calls it, so lets just use that Fixes https://github.com/element-hq/synapse/issues/16728	2024-04-11 16:03:45 +01:00
Patrick Cloke	657b8cc75c	Stabilize support for MSC4010: push rules & account data. (#17022 ) See [MSC4010](https://github.com/matrix-org/matrix-spec-proposals/pull/4010), but this is pretty much just removing an experimental flag. Part of #17021	2024-04-09 17:11:50 +01:00
Patrick Cloke	a2a543fd12	Stabliize support for MSC3981: recurse /relations (#17023 ) See [MSC3981](https://github.com/matrix-org/matrix-spec-proposals/pull/3981), this pretty much just removes flags though. Part of #17021	2024-04-09 17:11:08 +01:00
Erik Johnston	89f1092284	Also check if first event matches the last in prev batch (#17066 ) Refinement of #17064 cc @richvdh	2024-04-09 14:01:12 +00:00
Mathieu Velten	e363881592	Fix PR #16677 , a parameter was missing in a function call (#17033 ) Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com>	2024-04-09 14:06:46 +01:00
Erik Johnston	d40878451c	Add forgotten schema delta (#17054 ) This should have been in #17045. Whoops.	2024-04-09 13:03:41 +01:00
Erik Johnston	4d10a8fb18	Fixups to #17064 (#17065 ) Forget a line, and an empty batch is trivially linear. c.f. #17064	2024-04-08 14:55:19 +01:00
Erik Johnston	1f8f991d51	Add back fast path for non-gappy syncs (#17064 ) PR #16942 removed an invalid optimisation that avoided pulling out state for non-gappy syncs. This causes a large increase in DB usage. c.f. #16941 for why that optimisation was wrong. However, we can still optimise in the simple case where the events in the timeline are a linear chain without any branching/merging of the DAG. cc. @richvdh	2024-04-08 14:25:28 +01:00
Erik Johnston	5360baeb64	Pull out fewer receipts from DB when doing push (#17049 ) Before we were pulling out all read receipts for a user for every event we pushed. Instead let's only pull out the relevant receipts. This also pulled out the event rows for each receipt, causing load on the events table.	2024-04-05 12:46:34 +01:00
Richard van der Hoff	0e68e9b7f4	Fix bug in calculating state for non-gappy syncs (#16942 ) Unfortunately, the optimisation we applied here for non-gappy syncs is not actually valid. Fixes https://github.com/element-hq/synapse/issues/16941. ~~Based on https://github.com/element-hq/synapse/pull/16930.~~ Requires https://github.com/matrix-org/sytest/pull/1374.	2024-04-04 16:15:35 +00:00
Richard van der Hoff	230b709d9d	`/sync`: fix bug in calculating `state` response (#16930 ) Fix a long-standing issue which could cause state to be omitted from the sync response if the last event was filtered out. Fixes: https://github.com/element-hq/synapse/issues/16928	2024-04-04 12:14:24 +00:00
Richard van der Hoff	05957ac70f	Fix bug in `/sync` response for archived rooms (#16932 ) This PR fixes a very, very niche edge-case, but I've got some more work coming which will otherwise make the problem worse. The bug happens when the syncing user leaves a room, and has a sync filter which includes "left" rooms, but sets the timeline limit to 0. In that case, the state returned in the `state` section is calculated incorrectly. The fix is to pass a token corresponding to the point that the user leaves the room through to `compute_state_delta`.	2024-04-04 12:47:59 +01:00
Erik Johnston	31122b71bc	Add missing index to `access_tokens` table (#17045 ) This was causing sequential scans when using refresh tokens.	2024-04-04 11:05:40 +01:00
Erik Johnston	ec174d0470	Refactor chain fetching (#17044 ) Since these queries are duplicated in two places.	2024-04-02 15:33:56 +01:00
Erik Johnston	fd48fc4585	Fixups to new push stream (#17038 ) Follow on from #17037	2024-03-28 16:29:23 +00:00
Erik Johnston	ea6bfae0fc	Add support for moving `/push_rules` off of main process (#17037 )	2024-03-28 15:44:07 +00:00
Erik Johnston	c900d18647	Fix OIDC login regression (#17031 ) Requests may require a User-Agent header, and the change in #16972 accidentally removed it, resulting in requests getting rejected causing login to fail.	2024-03-26 13:26:46 +00:00
Richard van der Hoff	b5322b4daf	Ensure that pending to-device events are sent over federation at startup (#16925 ) Fixes https://github.com/element-hq/synapse/issues/16680, as well as a related bug, where servers which we had never successfully sent an event to would not be retried. In order to fix the case of pending to-device messages, we hook into the existing `wake_destinations_needing_catchup` process, by extending it to look for destinations that have pending to-device messages. The federation transmission loop then attempts to send the pending to-device messages as normal.	2024-03-22 13:24:11 +00:00
Mathieu Velten	b7af076ab5	Add OIDC config to add extra parameters to the authorize URL (#16971 )	2024-03-22 10:35:11 +00:00
SpiritCroc	9ad49e7ecf	Do not refuse to set read_marker if previous event_id is in wrong room (#16990 )	2024-03-21 18:43:07 +00:00
Hanadi	f7a3ebe44d	Fix reject knocks on deactivating account (#17010 )	2024-03-21 18:05:54 +00:00
Mathieu Velten	3ab9e6d524	OIDC: try to JWT decode userinfo response if JSON parsing failed (#16972 )	2024-03-21 17:49:44 +00:00
Shay	cf5adc80e1	Update power level default for public rooms (#16907 )	2024-03-19 17:55:31 +00:00
Shay	8fb5b0f335	Improve event validation (#16908 ) As the title states.	2024-03-19 17:52:53 +00:00
Mathieu Velten	74ab329eaa	Pass module API to OIDC mapping provider (#16974 ) As done for SAML mapping provider, let's pass the module API to the OIDC one so the mapper can do more logic in its code.	2024-03-19 17:20:10 +00:00
Richard van der Hoff	9635822cc1	Clarify docs for some room state functions (#16950 ) State before an event is different to state after that event, and people tend to assume the wrong one.	2024-03-19 17:16:37 +00:00
Richard van der Hoff	52f456a822	`/sync`: Fix edge-case in calculating the "device_lists" response (#16949 ) Fixes https://github.com/element-hq/synapse/issues/16948. If the `join` and the `leave` are in the same sync response, we need to count them as a "left" user.	2024-03-14 17:34:19 +00:00
Richard van der Hoff	6d5bafb2c8	Split up `SyncHandler.compute_state_delta` (#16929 ) This is a huge method, which melts my brain. This is a non-functional change which lays some groundwork for future work in this area.	2024-03-14 17:18:48 +00:00
Mathieu Velten	cb562d73aa	Improve lock performance when a lot of locks are waiting (#16840 ) When a lot of locks are waiting for a single lock, notifying all locks independently with `call_later` on each release is really costly and incurs some kind of async contention, where the CPU is spinning a lot for not much. The included test is taking around 30s before the change, and 0.5s after. It was found following failing tests with https://github.com/element-hq/synapse/pull/16827.	2024-03-14 13:49:54 +00:00
dependabot[bot]	9b5eef95ad	Bump ruff from 0.1.14 to 0.3.2 (#16994 )	2024-03-13 17:06:23 +00:00
dependabot[bot]	e161103b46	Bump mypy from 1.5.1 to 1.8.0 (#16901 )	2024-03-13 17:05:57 +00:00
dependabot[bot]	1e68b56a62	Bump black from 23.10.1 to 24.2.0 (#16936 )	2024-03-13 16:46:44 +00:00
Gerrit Gogel	1f88790764	Prevent locking up while processing batched_auth_events (#16968 ) This PR aims to fix #16895, caused by a regression in #7 and not fixed by #16903. The PR #16903 only fixes a starvation issue, where the CPU isn't released. There is a second issue, where the execution is blocked. This theory is supported by the flame graphs provided in #16895 and the fact that I see the CPU usage reducing and far below the limit. Since the changes in #7, the method `check_state_independent_auth_rules` is called with the additional parameter `batched_auth_events`: `6fa13b4f92/synapse/handlers/federation_event.py (L1741-L1743)` It makes the execution enter this if clause, introduced with #15195 `6fa13b4f92/synapse/event_auth.py (L178-L189)` There are two issues in the above code snippet. First, there is the blocking issue. I'm not entirely sure if this is a deadlock, starvation, or something different. In the beginning, I thought the copy operation was responsible. It wasn't. Then I investigated the nested `store.get_events` inside the function `update`. This was also not causing the blocking issue. Only when I replaced the set difference operation (`-` ) with a list comprehension, the blocking was resolved. Creating and comparing sets with a very large amount of events seems to be problematic. This is how the flamegraph looks now while persisting outliers. As you can see, the execution no longer locks up in the above function. ![output_2024-02-28_13-59-40](https://github.com/element-hq/synapse/assets/13143850/6db9c9ac-484f-47d0-bdde-70abfbd773ec) Second, the copying here doesn't serve any purpose, because only a shallow copy is created. This means the same objects from the original dict are referenced. This fails the intention of protecting these objects from mutation. The review of the original PR https://github.com/matrix-org/synapse/pull/15195 had an extensive discussion about this matter. Various approaches to copying the auth_events were attempted: 1) Implementing a deepcopy caused issues due to builtins.EventInternalMetadata not being pickleable. 2) Creating a dict with new objects akin to a deepcopy. 3) Creating a dict with new objects containing only necessary attributes. Concluding, there is no easy way to create an actual copy of the objects. Opting for a deepcopy can significantly strain memory and CPU resources, making it an inefficient choice. I don't see why the copy is necessary in the first place. Therefore I'm proposing to remove it altogether. After these changes, I was able to successfully join these rooms, without the main worker locking up: - #synapse:matrix.org - #element-android:matrix.org - #element-web:matrix.org - #ecips:matrix.org - #ipfs-chatter:ipfs.io - #python:matrix.org - #matrix:matrix.org	2024-03-12 15:07:36 +00:00
Alexander Fechler	48f59d3806	deactivated flag refactored to filter deactivated users. (#16874 ) Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com>	2024-03-11 16:08:04 +00:00
Patrick Cloke	696cc9e802	Stabilize support for Retry-After header (MSC4014) (#16947 )	2024-03-08 09:33:46 +00:00
Quentin Gliech	4af33015af	Fix joining remote rooms when a `on_new_event` callback is registered (#16973 ) Since Synapse 1.76.0, any module which registers a `on_new_event` callback would brick the ability to join remote rooms. This is because this callback tried to get the full state of the room, which would end up in a deadlock. Related: https://github.com/matrix-org/synapse-auto-accept-invite/issues/18 The following module would brick the ability to join remote rooms: ```python from typing import Any, Dict, Literal, Union import logging from synapse.module_api import ModuleApi, EventBase logger = logging.getLogger(__name__) class MyModule: def __init__(self, config: None, api: ModuleApi): self._api = api self._config = config self._api.register_third_party_rules_callbacks( on_new_event=self.on_new_event, ) async def on_new_event(self, event: EventBase, _state_map: Any) -> None: logger.info(f"Received new event: {event}") @staticmethod def parse_config(_config: Dict[str, Any]) -> None: return None ``` This is technically a breaking change, as we are now passing partial state on the `on_new_event` callback. However, this callback was broken for federated rooms since 1.76.0, and local rooms have full state anyway, so it's unlikely that it would change anything.	2024-03-06 16:00:20 +01:00
Andrew Morgan	8a05304222	Revert "Improve DB performance of calculating badge counts for push. (#16756 )" (#16979 )	2024-03-05 12:27:27 +00:00
Erik Johnston	cdbbf3653d	Don't lock up when joining large rooms (#16903 ) Co-authored-by: Andrew Morgan <andrew@amorgan.xyz>	2024-02-20 14:29:18 +00:00
kegsay	c51a2240d1	bugfix: always prefer unthreaded receipt when >1 exist (MSC4102) (#16927 ) Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com>	2024-02-20 14:12:06 +00:00
Remi Rampin	0621e8eb0e	Add metric for emails sent (#16881 ) This adds a counter `synapse_emails_sent_total` for emails sent. They are broken down by `type`, which are `password_reset`, `registration`, `add_threepid`, `notification` (matching the methods of `Mailer`).	2024-02-14 15:30:03 +00:00
Erik Johnston	7b4d7429f8	Don't invalidate the entire event cache when we purge history (#16905 ) We do this by adding support to the LRU cache for "extra indices" based on the cached value. This allows us to efficiently map from room ID to the cached events and only invalidate those.	2024-02-13 13:24:11 +00:00
Erik Johnston	01910b981f	Add a config to not send out device list updates for specific users (#16909 ) List of users not to send out device list updates for when they register new devices. This is useful to handle bot accounts. This is undocumented as its mostly a hack to test on matrix.org. Note: This will still send out device list updates if the device is later updated, e.g. end to end keys are added.	2024-02-13 13:23:03 +00:00
Erik Johnston	ea1b30940e	Merge remote-tracking branch 'origin/release-v1.101' into develop	2024-02-09 10:52:35 +00:00
Erik Johnston	bfa93d1d3b	Only do one concurrent fetch per server in keyring (#16894 ) Otherwise if we've stacked a bunch of requests for the keys of a server, we'll end up sending lots of concurrent requests for its keys, needlessly.	2024-02-09 10:51:11 +00:00
Erik Johnston	02a147039c	Increase batching when fetching auth chains (#16893 ) This basically reverts a change that was in https://github.com/element-hq/synapse/pull/16833, where we reduced the batching. The smaller batching can cause performance issues on busy servers and databases.	2024-02-09 10:51:00 +00:00
David Baker	71ca199165	Accept unprefixed form of MSC3981 recurse parameter (#16842 ) Now that the MSC3981 has passed FCP	2024-02-06 09:48:39 +00:00
dependabot[bot]	871f51c270	Bump lxml-stubs from 0.4.0 to 0.5.1 (#16885 )	2024-02-06 09:29:17 +00:00
Erik Johnston	adf15c4f6b	Run `ANALYZE` after fiddling with stats (#16849 ) Introduced in #16833 Fixes #16844	2024-01-24 13:57:12 +00:00
Erik Johnston	c925b45567	Speed up e2e device keys queries for bot accounts (#16841 ) This helps with bot accounts with lots of non-e2e devices. The change is basically to change the order of the join for the case of using `INNER JOIN`	2024-01-23 11:37:16 +00:00
Erik Johnston	23740eaa3d	Correctly mention previous copyright (#16820 ) During the migration the automated script to update the copyright headers accidentally got rid of some of the existing copyright lines. Reinstate them.	2024-01-23 11:26:48 +00:00
Erik Johnston	14c725f73b	Preparatory work for tweaking performance of auth chain lookups (#16833 )	2024-01-23 11:26:27 +00:00
Shay	a68b48a5dd	Allow room creation but not publishing to continue if room publication rules are violated when creating a new room. (#16811 ) Prior to this PR, if a request to create a public (public as in published to the rooms directory) room violated the room list publication rules set in the [config](https://matrix-org.github.io/synapse/latest/usage/configuration/config_documentation.html#room_list_publication_rules), the request to create the room was denied and the room was not created. This PR changes the behavior such that when a request to create a room published to the directory violates room list publication rules, the room is still created but the room is not published to the directory.	2024-01-22 13:59:45 +00:00

1 2 3 4 5 ...

16012 Commits