forked-synapse

mirror of https://mau.dev/maunium/synapse.git synced 2024-10-01 01:36:05 -04:00

Author	SHA1	Message	Date
Kegan Dougal	259442fa4c	bugfix: make msc3967 idempotent (#16943 ) MSC3967 was updated recently to make it more robust to network failures: > there is an existing cross-signing master key and it exactly matches the cross-signing master key provided in the request body. If there are any additional keys provided in the request (self signing key, user signing key) they MUST also match the existing keys stored on the server. In other words, the request contains no new keys. If there are new keys, UIA MUST be performed. https://github.com/matrix-org/matrix-spec-proposals/blob/hughns/device-signing-upload-uia/proposals/3967-device-signing-upload-uia.md#proposal This covers the case where the 200 OK is lost in transit so the client retries the upload, only to then get UIA'd. Complement tests: https://github.com/matrix-org/complement/pull/713 - passing example https://github.com/element-hq/synapse/actions/runs/7976948122/job/21778795094?pr=16943#step:7:8820 ### Pull Request Checklist <!-- Please read https://element-hq.github.io/synapse/latest/development/contributing_guide.html before submitting your pull request --> * [x] Pull request is based on the develop branch * [x] Pull request includes a [changelog file](https://element-hq.github.io/synapse/latest/development/contributing_guide.html#changelog). The entry should: - Be a short description of your change which makes sense to users. "Fixed a bug that prevented receiving messages from other servers." instead of "Moved X method from `EventStore` to `EventWorkerStore`.". - Use markdown where necessary, mostly for `code blocks`. - End with either a period (.) or an exclamation mark (!). - Start with a capital letter. - Feel free to credit yourself, by adding a sentence "Contributed by @github_username." or "Contributed by [Your Name]." to the end of the entry. * [x] [Code style](https://element-hq.github.io/synapse/latest/code_style.html) is correct (run the [linters](https://element-hq.github.io/synapse/latest/development/contributing_guide.html#run-the-linters)) --------- Co-authored-by: reivilibre <oliverw@matrix.org>	2024-04-15 10:57:56 +00:00
Nick Mills-Barrett	fe4719a268	Use receipts `event_stream_ordering` instead of joins (#17032 ) Resurrecting https://github.com/matrix-org/synapse/pull/13918. This should reduce IOPs incurred by joining to the events table to lookup stream ordering, which happens in many receipt handling code paths. Like the previous PR I believe sufficient time has passed between the original migration in DB schema 72 and now to merge this as-is. It's highly unlikely that both the migration is still ongoing AND (active) users still have any receipts prior to that date. In the unlikely event there is a receipt without a populated `event_stream_ordering` synapse will behave just as it does now when receipts exist for events that don't (yet): for push action calculation the receipts are just ignored. I've removed the validation on event IDs as this is already covered here: `59ceabcb97/synapse/handlers/receipts.py (L189-L192)`	2024-04-12 09:28:44 +01:00
Erik Johnston	3a30846bd0	Fix mypy on latest Twisted release (#17036 ) `ITransport.abortConnection` isn't a thing, but `HTTPChannel.forceAbortClient` calls it, so lets just use that Fixes https://github.com/element-hq/synapse/issues/16728	2024-04-11 16:03:45 +01:00
Patrick Cloke	657b8cc75c	Stabilize support for MSC4010: push rules & account data. (#17022 ) See [MSC4010](https://github.com/matrix-org/matrix-spec-proposals/pull/4010), but this is pretty much just removing an experimental flag. Part of #17021	2024-04-09 17:11:50 +01:00
Patrick Cloke	a2a543fd12	Stabliize support for MSC3981: recurse /relations (#17023 ) See [MSC3981](https://github.com/matrix-org/matrix-spec-proposals/pull/3981), this pretty much just removes flags though. Part of #17021	2024-04-09 17:11:08 +01:00
Erik Johnston	89f1092284	Also check if first event matches the last in prev batch (#17066 ) Refinement of #17064 cc @richvdh	2024-04-09 14:01:12 +00:00
Mathieu Velten	e363881592	Fix PR #16677 , a parameter was missing in a function call (#17033 ) Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com>	2024-04-09 14:06:46 +01:00
Erik Johnston	d40878451c	Add forgotten schema delta (#17054 ) This should have been in #17045. Whoops.	2024-04-09 13:03:41 +01:00
Erik Johnston	4d10a8fb18	Fixups to #17064 (#17065 ) Forget a line, and an empty batch is trivially linear. c.f. #17064	2024-04-08 14:55:19 +01:00
Erik Johnston	1f8f991d51	Add back fast path for non-gappy syncs (#17064 ) PR #16942 removed an invalid optimisation that avoided pulling out state for non-gappy syncs. This causes a large increase in DB usage. c.f. #16941 for why that optimisation was wrong. However, we can still optimise in the simple case where the events in the timeline are a linear chain without any branching/merging of the DAG. cc. @richvdh	2024-04-08 14:25:28 +01:00
Erik Johnston	5360baeb64	Pull out fewer receipts from DB when doing push (#17049 ) Before we were pulling out all read receipts for a user for every event we pushed. Instead let's only pull out the relevant receipts. This also pulled out the event rows for each receipt, causing load on the events table.	2024-04-05 12:46:34 +01:00
Richard van der Hoff	0e68e9b7f4	Fix bug in calculating state for non-gappy syncs (#16942 ) Unfortunately, the optimisation we applied here for non-gappy syncs is not actually valid. Fixes https://github.com/element-hq/synapse/issues/16941. ~~Based on https://github.com/element-hq/synapse/pull/16930.~~ Requires https://github.com/matrix-org/sytest/pull/1374.	2024-04-04 16:15:35 +00:00
Richard van der Hoff	230b709d9d	`/sync`: fix bug in calculating `state` response (#16930 ) Fix a long-standing issue which could cause state to be omitted from the sync response if the last event was filtered out. Fixes: https://github.com/element-hq/synapse/issues/16928	2024-04-04 12:14:24 +00:00
Richard van der Hoff	05957ac70f	Fix bug in `/sync` response for archived rooms (#16932 ) This PR fixes a very, very niche edge-case, but I've got some more work coming which will otherwise make the problem worse. The bug happens when the syncing user leaves a room, and has a sync filter which includes "left" rooms, but sets the timeline limit to 0. In that case, the state returned in the `state` section is calculated incorrectly. The fix is to pass a token corresponding to the point that the user leaves the room through to `compute_state_delta`.	2024-04-04 12:47:59 +01:00
Erik Johnston	31122b71bc	Add missing index to `access_tokens` table (#17045 ) This was causing sequential scans when using refresh tokens.	2024-04-04 11:05:40 +01:00
Erik Johnston	ec174d0470	Refactor chain fetching (#17044 ) Since these queries are duplicated in two places.	2024-04-02 15:33:56 +01:00
Erik Johnston	fd48fc4585	Fixups to new push stream (#17038 ) Follow on from #17037	2024-03-28 16:29:23 +00:00
Erik Johnston	ea6bfae0fc	Add support for moving `/push_rules` off of main process (#17037 )	2024-03-28 15:44:07 +00:00
Erik Johnston	c900d18647	Fix OIDC login regression (#17031 ) Requests may require a User-Agent header, and the change in #16972 accidentally removed it, resulting in requests getting rejected causing login to fail.	2024-03-26 13:26:46 +00:00
Richard van der Hoff	b5322b4daf	Ensure that pending to-device events are sent over federation at startup (#16925 ) Fixes https://github.com/element-hq/synapse/issues/16680, as well as a related bug, where servers which we had never successfully sent an event to would not be retried. In order to fix the case of pending to-device messages, we hook into the existing `wake_destinations_needing_catchup` process, by extending it to look for destinations that have pending to-device messages. The federation transmission loop then attempts to send the pending to-device messages as normal.	2024-03-22 13:24:11 +00:00
Mathieu Velten	b7af076ab5	Add OIDC config to add extra parameters to the authorize URL (#16971 )	2024-03-22 10:35:11 +00:00
SpiritCroc	9ad49e7ecf	Do not refuse to set read_marker if previous event_id is in wrong room (#16990 )	2024-03-21 18:43:07 +00:00
Hanadi	f7a3ebe44d	Fix reject knocks on deactivating account (#17010 )	2024-03-21 18:05:54 +00:00
Mathieu Velten	3ab9e6d524	OIDC: try to JWT decode userinfo response if JSON parsing failed (#16972 )	2024-03-21 17:49:44 +00:00
Shay	cf5adc80e1	Update power level default for public rooms (#16907 )	2024-03-19 17:55:31 +00:00
Shay	8fb5b0f335	Improve event validation (#16908 ) As the title states.	2024-03-19 17:52:53 +00:00
Mathieu Velten	74ab329eaa	Pass module API to OIDC mapping provider (#16974 ) As done for SAML mapping provider, let's pass the module API to the OIDC one so the mapper can do more logic in its code.	2024-03-19 17:20:10 +00:00
Richard van der Hoff	9635822cc1	Clarify docs for some room state functions (#16950 ) State before an event is different to state after that event, and people tend to assume the wrong one.	2024-03-19 17:16:37 +00:00
Richard van der Hoff	52f456a822	`/sync`: Fix edge-case in calculating the "device_lists" response (#16949 ) Fixes https://github.com/element-hq/synapse/issues/16948. If the `join` and the `leave` are in the same sync response, we need to count them as a "left" user.	2024-03-14 17:34:19 +00:00
Richard van der Hoff	6d5bafb2c8	Split up `SyncHandler.compute_state_delta` (#16929 ) This is a huge method, which melts my brain. This is a non-functional change which lays some groundwork for future work in this area.	2024-03-14 17:18:48 +00:00
Mathieu Velten	cb562d73aa	Improve lock performance when a lot of locks are waiting (#16840 ) When a lot of locks are waiting for a single lock, notifying all locks independently with `call_later` on each release is really costly and incurs some kind of async contention, where the CPU is spinning a lot for not much. The included test is taking around 30s before the change, and 0.5s after. It was found following failing tests with https://github.com/element-hq/synapse/pull/16827.	2024-03-14 13:49:54 +00:00
dependabot[bot]	9b5eef95ad	Bump ruff from 0.1.14 to 0.3.2 (#16994 )	2024-03-13 17:06:23 +00:00
dependabot[bot]	e161103b46	Bump mypy from 1.5.1 to 1.8.0 (#16901 )	2024-03-13 17:05:57 +00:00
dependabot[bot]	1e68b56a62	Bump black from 23.10.1 to 24.2.0 (#16936 )	2024-03-13 16:46:44 +00:00
Gerrit Gogel	1f88790764	Prevent locking up while processing batched_auth_events (#16968 ) This PR aims to fix #16895, caused by a regression in #7 and not fixed by #16903. The PR #16903 only fixes a starvation issue, where the CPU isn't released. There is a second issue, where the execution is blocked. This theory is supported by the flame graphs provided in #16895 and the fact that I see the CPU usage reducing and far below the limit. Since the changes in #7, the method `check_state_independent_auth_rules` is called with the additional parameter `batched_auth_events`: `6fa13b4f92/synapse/handlers/federation_event.py (L1741-L1743)` It makes the execution enter this if clause, introduced with #15195 `6fa13b4f92/synapse/event_auth.py (L178-L189)` There are two issues in the above code snippet. First, there is the blocking issue. I'm not entirely sure if this is a deadlock, starvation, or something different. In the beginning, I thought the copy operation was responsible. It wasn't. Then I investigated the nested `store.get_events` inside the function `update`. This was also not causing the blocking issue. Only when I replaced the set difference operation (`-` ) with a list comprehension, the blocking was resolved. Creating and comparing sets with a very large amount of events seems to be problematic. This is how the flamegraph looks now while persisting outliers. As you can see, the execution no longer locks up in the above function. ![output_2024-02-28_13-59-40](https://github.com/element-hq/synapse/assets/13143850/6db9c9ac-484f-47d0-bdde-70abfbd773ec) Second, the copying here doesn't serve any purpose, because only a shallow copy is created. This means the same objects from the original dict are referenced. This fails the intention of protecting these objects from mutation. The review of the original PR https://github.com/matrix-org/synapse/pull/15195 had an extensive discussion about this matter. Various approaches to copying the auth_events were attempted: 1) Implementing a deepcopy caused issues due to builtins.EventInternalMetadata not being pickleable. 2) Creating a dict with new objects akin to a deepcopy. 3) Creating a dict with new objects containing only necessary attributes. Concluding, there is no easy way to create an actual copy of the objects. Opting for a deepcopy can significantly strain memory and CPU resources, making it an inefficient choice. I don't see why the copy is necessary in the first place. Therefore I'm proposing to remove it altogether. After these changes, I was able to successfully join these rooms, without the main worker locking up: - #synapse:matrix.org - #element-android:matrix.org - #element-web:matrix.org - #ecips:matrix.org - #ipfs-chatter:ipfs.io - #python:matrix.org - #matrix:matrix.org	2024-03-12 15:07:36 +00:00
Alexander Fechler	48f59d3806	deactivated flag refactored to filter deactivated users. (#16874 ) Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com>	2024-03-11 16:08:04 +00:00
Patrick Cloke	696cc9e802	Stabilize support for Retry-After header (MSC4014) (#16947 )	2024-03-08 09:33:46 +00:00
Quentin Gliech	4af33015af	Fix joining remote rooms when a `on_new_event` callback is registered (#16973 ) Since Synapse 1.76.0, any module which registers a `on_new_event` callback would brick the ability to join remote rooms. This is because this callback tried to get the full state of the room, which would end up in a deadlock. Related: https://github.com/matrix-org/synapse-auto-accept-invite/issues/18 The following module would brick the ability to join remote rooms: ```python from typing import Any, Dict, Literal, Union import logging from synapse.module_api import ModuleApi, EventBase logger = logging.getLogger(__name__) class MyModule: def __init__(self, config: None, api: ModuleApi): self._api = api self._config = config self._api.register_third_party_rules_callbacks( on_new_event=self.on_new_event, ) async def on_new_event(self, event: EventBase, _state_map: Any) -> None: logger.info(f"Received new event: {event}") @staticmethod def parse_config(_config: Dict[str, Any]) -> None: return None ``` This is technically a breaking change, as we are now passing partial state on the `on_new_event` callback. However, this callback was broken for federated rooms since 1.76.0, and local rooms have full state anyway, so it's unlikely that it would change anything.	2024-03-06 16:00:20 +01:00
Andrew Morgan	8a05304222	Revert "Improve DB performance of calculating badge counts for push. (#16756 )" (#16979 )	2024-03-05 12:27:27 +00:00
Erik Johnston	cdbbf3653d	Don't lock up when joining large rooms (#16903 ) Co-authored-by: Andrew Morgan <andrew@amorgan.xyz>	2024-02-20 14:29:18 +00:00
kegsay	c51a2240d1	bugfix: always prefer unthreaded receipt when >1 exist (MSC4102) (#16927 ) Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com>	2024-02-20 14:12:06 +00:00
Remi Rampin	0621e8eb0e	Add metric for emails sent (#16881 ) This adds a counter `synapse_emails_sent_total` for emails sent. They are broken down by `type`, which are `password_reset`, `registration`, `add_threepid`, `notification` (matching the methods of `Mailer`).	2024-02-14 15:30:03 +00:00
Erik Johnston	7b4d7429f8	Don't invalidate the entire event cache when we purge history (#16905 ) We do this by adding support to the LRU cache for "extra indices" based on the cached value. This allows us to efficiently map from room ID to the cached events and only invalidate those.	2024-02-13 13:24:11 +00:00
Erik Johnston	01910b981f	Add a config to not send out device list updates for specific users (#16909 ) List of users not to send out device list updates for when they register new devices. This is useful to handle bot accounts. This is undocumented as its mostly a hack to test on matrix.org. Note: This will still send out device list updates if the device is later updated, e.g. end to end keys are added.	2024-02-13 13:23:03 +00:00
Erik Johnston	ea1b30940e	Merge remote-tracking branch 'origin/release-v1.101' into develop	2024-02-09 10:52:35 +00:00
Erik Johnston	bfa93d1d3b	Only do one concurrent fetch per server in keyring (#16894 ) Otherwise if we've stacked a bunch of requests for the keys of a server, we'll end up sending lots of concurrent requests for its keys, needlessly.	2024-02-09 10:51:11 +00:00
Erik Johnston	02a147039c	Increase batching when fetching auth chains (#16893 ) This basically reverts a change that was in https://github.com/element-hq/synapse/pull/16833, where we reduced the batching. The smaller batching can cause performance issues on busy servers and databases.	2024-02-09 10:51:00 +00:00
David Baker	71ca199165	Accept unprefixed form of MSC3981 recurse parameter (#16842 ) Now that the MSC3981 has passed FCP	2024-02-06 09:48:39 +00:00
dependabot[bot]	871f51c270	Bump lxml-stubs from 0.4.0 to 0.5.1 (#16885 )	2024-02-06 09:29:17 +00:00
Erik Johnston	adf15c4f6b	Run `ANALYZE` after fiddling with stats (#16849 ) Introduced in #16833 Fixes #16844	2024-01-24 13:57:12 +00:00

1 2 3 4 5 ...

15916 Commits