forked-synapse

mirror of https://mau.dev/maunium/synapse.git synced 2024-10-01 01:36:05 -04:00

Author	SHA1	Message	Date
Erik Johnston	f721f1baba	Revert "Make all `process_replication_rows` methods async (#13304 )" (#13312 ) This reverts commit `5d4028f217`.	2022-07-18 14:28:14 +01:00
Nick Mills-Barrett	5d4028f217	Make all `process_replication_rows` methods async (#13304 ) More prep work for asyncronous caching, also makes all process_replication_rows methods consistent (presence handler already is so). Signed off by Nick @ Beeper (@Fizzadar)	2022-07-17 22:19:43 +01:00
Sean Quah	1391a76cd2	Faster room joins: fix race in recalculation of current room state (#13151 ) Bounce recalculation of current state to the correct event persister and move recalculation of current state into the event persistence queue, to avoid concurrent updates to a room's current state. Also give recalculation of a room's current state a real stream ordering. Signed-off-by: Sean Quah <seanq@matrix.org>	2022-07-07 12:19:31 +00:00
Sean Quah	68db233f0c	Handle race between persisting an event and un-partial stating a room (#13100 ) Whenever we want to persist an event, we first compute an event context, which includes the state at the event and a flag indicating whether the state is partial. After a lot of processing, we finally try to store the event in the database, which can fail for partial state events when the containing room has been un-partial stated in the meantime. We detect the race as a foreign key constraint failure in the data store layer and turn it into a special `PartialStateConflictError` exception, which makes its way up to the method in which we computed the event context. To make things difficult, the exception needs to cross a replication request: `/fed_send_events` for events coming over federation and `/send_event` for events from clients. We transport the `PartialStateConflictError` as a `409 Conflict` over replication and turn `409`s back into `PartialStateConflictError`s on the worker making the request. All client events go through `EventCreationHandler.handle_new_client_event`, which is called in a lot of places. Instead of trying to update all the code which creates client events, we turn the `PartialStateConflictError` into a `429 Too Many Requests` in `EventCreationHandler.handle_new_client_event` and hope that clients take it as a hint to retry their request. On the federation event side, there are 7 places which compute event contexts. 4 of them use outlier event contexts: `FederationEventHandler._auth_and_persist_outliers_inner`, `FederationHandler.do_knock`, `FederationHandler.on_invite_request` and `FederationHandler.do_remotely_reject_invite`. These events won't have the partial state flag, so we do not need to do anything for then. The remaining 3 paths which create events are `FederationEventHandler.process_remote_join`, `FederationEventHandler.on_send_membership_event` and `FederationEventHandler._process_received_pdu`. We can't experience the race in `process_remote_join`, unless we're handling an additional join into a partial state room, which currently blocks, so we make no attempt to handle it correctly. `on_send_membership_event` is only called by `FederationServer._on_send_membership_event`, so we catch the `PartialStateConflictError` there and retry just once. `_process_received_pdu` is called by `on_receive_pdu` for incoming events and `_process_pulled_event` for backfill. The latter should never try to persist partial state events, so we ignore it. We catch the `PartialStateConflictError` in `on_receive_pdu` and retry just once. Refering to the graph of code paths in https://github.com/matrix-org/synapse/issues/12988#issuecomment-1156857648 may make the above make more sense. Signed-off-by: Sean Quah <seanq@matrix.org>	2022-07-05 16:12:52 +01:00
David Robertson	97e9fbe1b2	Type annotations in `synapse.databases.main.devices` (#13025 ) Co-authored-by: Patrick Cloke <clokep@users.noreply.github.com>	2022-06-15 15:20:04 +00:00
Patrick Cloke	cf05258f76	Remove groups replication code. (#12900 ) The replication logic for groups is no longer used, so the message passing infrastructure can be removed.	2022-05-31 13:04:08 -04:00
Erik Johnston	1e453053cb	Rename storage classes (#12913 )	2022-05-31 12:17:50 +00:00
reivilibre	39dee30f01	Send `USER_IP` commands on a different Redis channel, in order to reduce traffic to workers that do not process these commands. (#12809 )	2022-05-20 15:28:23 +01:00
reivilibre	177b884ad7	Lay some foundation work to allow workers to only subscribe to some kinds of messages, reducing replication traffic. (#12672 )	2022-05-19 16:29:08 +01:00
Andrew Morgan	83be72d76c	Add `StreamKeyType` class and replace string literals with constants (#12567 )	2022-05-16 15:35:31 +00:00
Sean Quah	a559c8b0d9	Respect the `@cancellable` flag for `ReplicationEndpoint`s (#12700 ) While `ReplicationEndpoint`s register themselves via `JsonResource`, they pass a method that calls the handler, instead of the handler itself, to `register_paths`. As a result, `JsonResource` will not correctly pick up the `@cancellable` flag and we have to apply it ourselves. Signed-off-by: Sean Quah <seanq@element.io>	2022-05-11 12:25:39 +01:00
Shay	d80a7ab151	Update `replication.md` with info on TCP module structure (#12621 )	2022-05-09 14:46:43 -07:00
Šimon Brandner	ef86cf3d28	Update `_on_new_receipts()` to work with MSC2285 changes. (#12636 )	2022-05-05 13:25:51 +00:00
Erik Johnston	c0379d6e5b	Reduce log spam when running multiple event persisters (#12610 )	2022-05-05 10:20:23 +01:00
Erik Johnston	d1cd96ce29	Add opentracing spans to calls to external cache (#12380 )	2022-04-07 13:18:29 +01:00
Sean Quah	800ba87cc8	Refactor and convert `Linearizer` to async (#12357 ) Refactor and convert `Linearizer` to async. This makes a `Linearizer` cancellation bug easier to fix. Also refactor to use an async context manager, which eliminates an unlikely footgun where code that doesn't immediately use the context manager could forget to release the lock. Signed-off-by: Sean Quah <seanq@element.io>	2022-04-05 15:43:52 +01:00
Erik Johnston	66053b6bfb	Prefill more stream change caches. (#12372 )	2022-04-05 14:26:41 +01:00
Erik Johnston	b446c99ac9	Prefill the device_list_stream_cache (#12367 ) * Prefill the device_list_stream_cache * Newsfile * Newsfile	2022-04-04 20:12:25 +01:00
Erik Johnston	5c9e39e619	Track device list updates per room. (#12321 ) This is a first step in dealing with #7721. The idea is basically that rather than calculating the full set of users a device list update needs to be sent to up front, we instead simply record the rooms the user was in at the time of the change. This will allow a few things: 1. we can defer calculating the set of remote servers that need to be poked about the change; and 2. during `/sync` and `/keys/changes` we can avoid also avoid calculating users who share rooms with other users, and instead just look at the rooms that have changed. However, care needs to be taken to correctly handle server downgrades. As such this PR writes to both `device_lists_changes_in_room` and the `device_lists_outbound_pokes` table synchronously. In a future release we can then bump the database schema compat version to `69` and then we can assume that the new `device_lists_changes_in_room` exists and is handled. There is a temporary option to disable writing to `device_lists_outbound_pokes` synchronously, allowing us to test the new code path does work (and by implication upgrading to a future release and downgrading to this one will work correctly). Note: Ideally we'd do the calculation of room to servers on a worker (e.g. the background worker), but currently only master can write to the `device_list_outbound_pokes` table.	2022-04-04 15:25:20 +01:00
reivilibre	f871222880	Move `update_client_ip` background job from the main process to the background worker. (#12251 )	2022-04-01 13:08:55 +01:00
David Robertson	a2b00a4486	Bump `black` and `click` versions (#12320 )	2022-03-29 10:41:19 +00:00
reivilibre	4a53f35737	Improve code documentation for the typing stream over replication. (#12211 )	2022-03-11 14:00:15 +00:00
Patrick Cloke	3e4af36bc8	Rename get_tcp_replication to get_replication_command_handler. (#12192 ) Since the object it returns is a ReplicationCommandHandler. This is clean-up from adding support to Redis where the command handler was added as an additional layer of abstraction from the TCP protocol.	2022-03-10 13:01:56 +00:00
Nick Mills-Barrett	180d8ff0d4	Retry some http replication failures (#12182 ) This allows for the target process to be down for around a minute which provides time for restarts during synapse upgrades/config updates. Closes: #12178 Signed off by Nick Mills-Barrett nick@beeper.com	2022-03-09 14:53:28 +00:00
Patrick Cloke	d8bab6793c	Fix incorrect type hints for txredis. (#12042 ) Some properties were marked as RedisProtocol instead of ConnectionHandler, which wraps RedisProtocol instance(s).	2022-03-08 07:26:05 -05:00
Erik Johnston	423cca9efe	Spread out sending device lists to remote hosts (#12132 )	2022-03-04 11:48:15 +00:00
Richard van der Hoff	e24ff8ebe3	Remove `HomeServer.get_datastore()` (#12031 ) The presence of this method was confusing, and mostly present for backwards compatibility. Let's get rid of it. Part of #11733	2022-02-23 11:04:02 +00:00
Erik Johnston	6d14b3dabf	Better error message when failing to request from another process (#12060 )	2022-02-22 15:52:08 +00:00
Patrick Cloke	d0e78af35e	Add missing type hints to synapse.replication. (#11938 )	2022-02-08 11:03:08 -05:00
Patrick Cloke	6c0984e3f0	Remove unnecessary ignores due to Twisted upgrade. (#11939 ) Twisted 22.1.0 fixed some internal type hints, allowing Synapse to remove ignore calls for parameters to connectTCP.	2022-02-08 09:15:59 -05:00
Patrick Cloke	63d90f10ec	Add missing type hints to synapse.replication.http. (#11856 )	2022-02-08 07:44:39 -05:00
Richard van der Hoff	2277275485	Stop reading from `event_reference_hashes` (#11794 ) Preparation for dropping this table altogether. Part of #6574.	2022-01-21 09:18:10 +00:00
Patrick Cloke	10a88ba91c	Use auto_attribs/native type hints for attrs classes. (#11692 )	2022-01-13 13:49:28 +00:00
Richard van der Hoff	2359ee3864	Remove redundant `get_current_events_token` (#11643 ) * Push `get_room_{min,max_stream_ordering}` into StreamStore Both implementations of this are identical, so we may as well push it down and get rid of the abstract base class nonsense. * Remove redundant `StreamStore` class This is empty now * Remove redundant `get_current_events_token` This was an exact duplicate of `get_room_max_stream_ordering`, so let's get rid of it. * newsfile	2022-01-04 16:10:27 +00:00
Patrick Cloke	cbd82d0b2d	Convert all namedtuples to attrs. (#11665 ) To improve type hints throughout the code.	2021-12-30 18:47:12 +00:00
Sean Quah	5305a5e881	Type hint the constructors of the data store classes (#11555 )	2021-12-13 17:05:00 +00:00
Quentin Gliech	a15a893df8	Save the OIDC session ID (sid) with the device on login (#11482 ) As a step towards allowing back-channel logout for OIDC.	2021-12-06 12:43:06 -05:00
Sean Quah	ffd858aa68	Add type hints to `synapse/storage/databases/main/events_worker.py` (#11411 ) Also refactor the stream ID trackers/generators a bit and try to document them better.	2021-11-26 18:41:31 +00:00
Patrick Cloke	5cace20bf1	Add missing type hints to `synapse.app`. (#11287 )	2021-11-10 15:06:54 -05:00
Nick Barrett	af54167516	Enable passing typing stream writers as a list. (#11237 ) This makes the typing stream writer config match the other stream writers that only currently support a single worker.	2021-11-03 14:25:47 +00:00
Brendan Abolivier	c7a5e49664	Implement an `on_new_event` callback (#11126 ) Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com>	2021-10-26 15:17:36 +02:00
Sean Quah	2b82ec425f	Add type hints for most `HomeServer` parameters (#11095 )	2021-10-22 18:15:41 +01:00
Sean Quah	6a67f3786a	Fix logging context warnings when losing replication connection (#10984 ) Instead of triggering `__exit__` manually on the replication handler's logging context, use it as a context manager so that there is an `__enter__` call to balance the `__exit__`.	2021-10-15 13:10:58 +01:00
Sean Quah	6b18eb4430	Fix opentracing and Prometheus metrics for replication requests (#10996 ) This commit fixes two bugs to do with decorators not instrumenting `ReplicationEndpoint`'s `send_request` correctly. There are two decorators on `send_request`: Prometheus' `Gauge.track_inprogress()` and Synapse's `opentracing.trace`. `Gauge.track_inprogress()` does not have any support for async functions when used as a decorator. Since async functions behave like regular functions that return coroutines, only the creation of the coroutine was covered by the metric and none of the actual body of `send_request`. `Gauge.track_inprogress()` returns a regular, non-async function wrapping `send_request`, which is the source of the next bug. The `opentracing.trace` decorator would normally handle async functions correctly, but since the wrapped `send_request` is a non-async function, the decorator ends up suffering from the same issue as `Gauge.track_inprogress()`: the opentracing span only measures the creation of the coroutine and none of the actual function body. Using `Gauge.track_inprogress()` as a context manager instead of a decorator resolves both bugs.	2021-10-12 11:23:46 +01:00
David Robertson	51a5da74cc	Annotate synapse.storage.util (#10892 ) Also mark `synapse.streams` as having has no untyped defs Co-authored-by: Sean Quah <8349537+squahtx@users.noreply.github.com>	2021-10-08 14:25:16 +00:00
Patrick Cloke	f4b1a9a527	Require direct references to configuration variables. (#10985 ) This removes the magic allowing accessing configurable variables directly from the config object. It is now required that a specific configuration class is used (e.g. `config.foo` must be replaced with `config.server.foo`).	2021-10-06 10:47:41 -04:00
David Robertson	29364145b2	Pass str to twisted's IReactorTCP (#10895 ) This follows a correction made in twisted/twisted#1664 and should fix our Twisted Trial CI job. Until that change is in a twisted release, we'll have to ignore the type of the `host` argument. I've raised #10899 to remind us to review the issue in a few months' time.	2021-09-30 12:51:47 +01:00
Patrick Cloke	94b620a5ed	Use direct references for configuration variables (part 6). (#10916 )	2021-09-29 06:44:15 -04:00
Patrick Cloke	bb7fdd821b	Use direct references for configuration variables (part 5). (#10897 )	2021-09-24 07:25:21 -04:00
Patrick Cloke	01c88a09cd	Use direct references for some configuration variables (#10798 ) Instead of proxying through the magic getter of the RootConfig object. This should be more performant (and is more explicit).	2021-09-13 13:07:12 -04:00
Richard van der Hoff	1800aabfc2	Split `FederationHandler` in half (#10692 ) The idea here is to take anything to do with incoming events and move it out to a separate handler, as a way of making FederationHandler smaller.	2021-08-26 21:41:44 +01:00
Andrew Morgan	84469bdac7	Remove the unused public_room_list_stream (#10565 ) Co-authored-by: Patrick Cloke <clokep@users.noreply.github.com>	2021-08-17 14:02:50 +01:00
Richard van der Hoff	d9cb658c78	Fix up type hints for Twisted 21.7 (#10490 ) Mostly this involves decorating a few Deferred declarations with extra type hints. We wrap the types in quotes to avoid runtime errors when running against older versions of Twisted that don't have generics on Deferred.	2021-07-28 12:04:11 +00:00
Šimon Brandner	c3b037795a	Support for MSC2285 (hidden read receipts) (#10413 ) Implementation of matrix-org/matrix-doc#2285	2021-07-28 10:05:11 +02:00
Jonathan de Jong	bf72d10dbf	Use inline type hints in various other places (in `synapse/`) (#10380 )	2021-07-15 11:02:43 +01:00
Quentin Gliech	bd4919fb72	MSC2918 Refresh tokens implementation (#9450 ) This implements refresh tokens, as defined by MSC2918 This MSC has been implemented client side in Hydrogen Web: vector-im/hydrogen-web#235 The basics of the MSC works: requesting refresh tokens on login, having the access tokens expire, and using the refresh token to get a new one. Signed-off-by: Quentin Gliech <quentingliech@gmail.com>	2021-06-24 14:33:20 +01:00
Marcus	8070b893db	update black to 21.6b0 (#10197 ) Reformat all files with the new version. Signed-off-by: Marcus Hoffmann <bubu@bubu1.eu>	2021-06-17 15:20:06 +01:00
Richard van der Hoff	d7808a2dde	Extend `ResponseCache` to pass a context object into the callback (#10157 ) This is the first of two PRs which seek to address #8518. This first PR lays the groundwork by extending ResponseCache; a second PR (#10158) will update the SyncHandler to actually use it, and fix the bug. The idea here is that we allow the callback given to ResponseCache.wrap to decide whether its result should be cached or not. We do that by (optionally) passing a ResponseCacheContext into it, which it can modify.	2021-06-14 10:26:09 +01:00
Sorunome	d936371b69	Implement knock feature (#6739 ) This PR aims to implement the knock feature as proposed in https://github.com/matrix-org/matrix-doc/pull/2403 Signed-off-by: Sorunome mail@sorunome.de Signed-off-by: Andrew Morgan andrewm@element.io	2021-06-09 19:39:51 +01:00
Richard van der Hoff	1bf83a191b	Clean up the interface for injecting opentracing over HTTP (#10143 ) * Remove unused helper functions * Clean up the interface for injecting opentracing over HTTP * changelog	2021-06-09 11:33:00 +01:00
Richard van der Hoff	224f2f949b	Combine `LruCache.invalidate` and `invalidate_many` (#9973 ) * Make `invalidate` and `invalidate_many` do the same thing ... so that we can do either over the invalidation replication stream, and also because they always confused me a bit. * Kill off `invalidate_many` * changelog	2021-05-27 10:33:56 +01:00
Richard van der Hoff	c0df6bae06	Remove `keylen` from `LruCache`. (#9993 ) `keylen` seems to be a thing that is frequently incorrectly set, and we don't really need it. The only time it was used was to figure out if we had removed a subtree in `del_multi`, which we can do better by changing `TreeCache.pop` to return a different type (`TreeCacheNode`). Commits should be independently reviewable.	2021-05-24 14:02:01 +01:00
Erik Johnston	3e831f24ff	Don't hammer the database for destination retry timings every ~5mins (#10036 )	2021-05-21 17:57:08 +01:00
Andrew Morgan	4d6e5a5e99	Use a database table to hold the users that should have full presence sent to them, instead of something in-memory (#9823 )	2021-05-18 14:13:45 +01:00
Richard van der Hoff	b378d98c8f	Add debug logging for issue #9533 (#9959 ) Hopefully this will help us track down where to-device messages are getting lost/delayed.	2021-05-11 11:04:03 +01:00
Erik Johnston	e3bc4617fc	Time external cache response time (#9904 )	2021-05-04 15:14:22 +01:00
Erik Johnston	9d25a0ae65	Split presence out of master (#9820 )	2021-04-23 12:21:55 +01:00
Richard van der Hoff	294c675033	Remove `synapse.types.Collection` (#9856 ) This is no longer required, since we have dropped support for Python 3.5.	2021-04-22 16:43:50 +01:00
Andrew Morgan	4b2217ace2	Merge branch 'master' into develop	2021-04-21 14:55:06 +01:00
Richard van der Hoff	5d281c10dd	Stop BackgroundProcessLoggingContext making new prometheus timeseries (#9854 ) This undoes part of `b076bc276e`.	2021-04-21 10:03:31 +01:00
Andrew Morgan	6982db9651	Merge branch 'master' into develop	2021-04-20 14:55:16 +01:00
Patrick Cloke	b076bc276e	Always use the name as the log ID. (#9829 ) As far as I can tell our logging contexts are meant to log the request ID, or sometimes the request ID followed by a suffix (this is generally stored in the name field of LoggingContext). There's also code to log the name@memory location, but I'm not sure this is ever used. This simplifies the code paths to require every logging context to have a name and use that in logging. For sub-contexts (created via nested_logging_contexts, defer_to_threadpool, Measure) we use the current context's str (which becomes their name or the string "sentinel") and then potentially modify that (e.g. add a suffix).	2021-04-20 14:19:00 +01:00
Erik Johnston	de0d088adc	Add presence federation stream (#9819 )	2021-04-20 14:11:24 +01:00
Erik Johnston	00a6db9676	Move some replication processing out of generic_worker (#9796 ) Co-authored-by: Richard van der Hoff <1389908+richvdh@users.noreply.github.com>	2021-04-14 17:06:06 +01:00
Jonathan de Jong	4b965c862d	Remove redundant "coding: utf-8" lines (#9786 ) Part of #9744 Removes all redundant `# -- coding: utf-8 --` lines from files, as python 3 automatically reads source code as utf-8 now. `Signed-off-by: Jonathan de Jong <jonathan@automatia.nl>`	2021-04-14 15:34:27 +01:00
Patrick Cloke	48d44ab142	Record more information into structured logs. (#9654 ) Records additional request information into the structured logs, e.g. the requester, IP address, etc.	2021-04-08 08:01:14 -04:00
Jonathan de Jong	e2b8a90897	Update mypy configuration: `no_implicit_optional = True` (#9742 )	2021-04-05 09:10:18 -04:00
Erik Johnston	963f4309fe	Make RateLimiter class check for ratelimit overrides (#9711 ) This should fix a class of bug where we forget to check if e.g. the appservice shouldn't be ratelimited. We also check the `ratelimit_override` table to check if the user has ratelimiting disabled. That table is really only meant to override the event sender ratelimiting, so we don't use any values from it (as they might not make sense for different rate limits), but we do infer that if ratelimiting is disabled for the user we should disabled all ratelimits. Fixes #9663	2021-03-30 12:06:09 +01:00
Patrick Cloke	da75d2ea1f	Add type hints for the federation sender. (#9681 ) Includes an abstract base class which both the FederationSender and the FederationRemoteSendQueue must implement.	2021-03-29 11:43:20 -04:00
Erik Johnston	b5efcb577e	Make it possible to use dmypy (#9692 ) Running `dmypy run` will do a `mypy` check while spinning up a daemon that makes rerunning `dmypy run` a lot faster. `dmypy` doesn't support `follow_imports = silent` and has `local_partial_types` enabled, so this PR enables those options and fixes the issues that were newly raised. Note that `local_partial_types` will be enabled by default in upcoming mypy releases.	2021-03-26 16:49:46 +00:00
Patrick Cloke	b7748d3c00	Import HomeServer from the proper module. (#9665 )	2021-03-23 07:12:48 -04:00
Patrick Cloke	cc324d53fe	Fix up types for the typing handler. (#9638 ) By splitting this to two separate methods the callers know what methods they can expect on the handler.	2021-03-17 11:30:21 -04:00
Richard van der Hoff	567f88f835	Prep work for removing `outlier` from `internal_metadata` (#9411 ) * Populate `internal_metadata.outlier` based on `events` table Rather than relying on `outlier` being in the `internal_metadata` column, populate it based on the `events.outlier` column. * Move `outlier` out of InternalMetadata._dict Ultimately, this will allow us to stop writing it to the database. For now, we have to grandfather it back in so as to maintain compatibility with older versions of Synapse.	2021-03-17 12:33:18 +00:00
Patrick Cloke	d29b71aa50	Fix remaining mypy issues due to Twisted upgrade. (#9608 )	2021-03-15 11:14:39 -04:00
Patrick Cloke	55da8df078	Fix additional type hints from Twisted 21.2.0. (#9591 )	2021-03-12 11:37:57 -05:00
Richard van der Hoff	464e5da7b2	Add logging for redis connection setup (#9590 )	2021-03-11 18:35:09 +00:00
Richard van der Hoff	1107214a1d	Fix the auth provider on the logins metric (#9573 ) We either need to pass the auth provider over the replication api, or make sure we report the auth provider on the worker that received the request. I've gone with the latter.	2021-03-10 18:15:03 +00:00
Jonathan de Jong	d6196efafc	Add ResponseCache tests. (#9458 )	2021-03-08 14:00:07 -05:00
Patrick Cloke	58114f8a17	Create a SynapseReactor type which incorporates the necessary reactor interfaces. (#9528 ) This helps fix some type hints when running with Twisted 21.2.0.	2021-03-08 08:25:43 -05:00
Patrick Cloke	33a02f0f52	Fix additional type hints from Twisted upgrade. (#9518 )	2021-03-03 15:47:38 -05:00
Patrick Cloke	0c330423bc	Bump the mypy and mypy-zope versions. (#9529 )	2021-03-03 07:19:19 -05:00
Patrick Cloke	a0bc9d387e	Use the proper Request in type hints. (#9515 ) This also pins the Twisted version in the mypy job for CI until proper type hints are fixed throughout Synapse.	2021-03-01 12:23:46 -05:00
Erik Johnston	66f4949e7f	Fix deleting pushers when using sharded pushers. (#9465 )	2021-02-22 21:14:42 +00:00
AndrewFerr	9bc74743d5	Add configs to make profile data more private (#9203 ) Add off-by-default configuration settings to: - disable putting an invitee's profile info in invite events - disable profile lookup via federation Signed-off-by: Andrew Ferrazzutti <fair@miscworks.net>	2021-02-19 09:50:41 +00:00
Eric Eastwood	0a00b7ff14	Update black, and run auto formatting over the codebase (#9381 ) - Update black version to the latest - Run black auto formatting over the codebase - Run autoformatting according to [`docs/code_style.md `](`80d6dc9783/docs/code_style.md`) - Update `code_style.md` docs around installing black to use the correct version	2021-02-16 22:32:34 +00:00
Erik Johnston	6aa87f8ce3	Ensure that we never stop reconnecting to redis (#9391 )	2021-02-11 16:06:29 +00:00
Erik Johnston	dd8da8c5f6	Precompute joined hosts and store in Redis (#9198 )	2021-01-26 13:57:31 +00:00
Erik Johnston	a1ff1e967f	Periodically send pings to detect dead Redis connections (#9218 ) This is done by creating a custom `RedisFactory` subclass that periodically pings all connections in its pool. We also ensure that the `replyTimeout` param is non-null, so that we timeout waiting for the reply to those pings (and thus triggering a reconnect).	2021-01-26 10:54:54 +00:00
Erik Johnston	6633a4015a	Allow moving account data and receipts streams off master (#9104 )	2021-01-18 15:47:59 +00:00
Erik Johnston	f08ef64926	Enforce all replication HTTP clients calls use kwargs (#9144 )	2021-01-18 15:24:04 +00:00
Erik Johnston	b530eaa262	Allow running sendToDevice on workers (#9044 )	2021-01-07 20:19:26 +00:00
Erik Johnston	63593134a1	Some cleanups to device inbox store. (#9041 )	2021-01-07 17:20:44 +00:00
Erik Johnston	a7a913918c	Merge remote-tracking branch 'origin/erikj/as_mau_block' into develop	2020-12-18 09:51:56 +00:00
Erik Johnston	4c33796b20	Correctly handle AS registerations and add test	2020-12-17 12:55:21 +00:00
Patrick Cloke	bd30cfe86a	Convert internal pusher dicts to attrs classes. (#8940 ) This improves type hinting and should use less memory.	2020-12-16 11:25:30 -05:00
Patrick Cloke	1619802228	Various clean-ups to the logging context code (#8935 )	2020-12-14 14:19:47 -05:00
Patrick Cloke	96358cb424	Add authentication to replication endpoints. (#8853 ) Authentication is done by checking a shared secret provided in the Synapse configuration file.	2020-12-04 10:56:28 -05:00
Andrew Morgan	5cbe8d93fe	Add typing to membership Replication class methods (#8809 ) This PR grew out of #6739, and adds typing to some method arguments You'll notice that there are a lot of `# type: ignores` in here. This is due to the base methods not matching the overloads here. This is necessary to stop mypy complaining, but a better solution is #8828.	2020-11-27 10:49:38 +00:00
Andrew Morgan	e8d0853739	Generalise _maybe_store_room_on_invite (#8754 ) There's a handy function called maybe_store_room_on_invite which allows us to create an entry in the rooms table for a room and its version for which we aren't joined to yet, but we can reference when ingesting events about. This is currently used for invites where we receive some stripped state about the room and pass it down via /sync to the client, without us being in the room yet. There is a similar requirement for knocking, where we will eventually do the same thing, and need an entry in the rooms table as well. Thus, reusing this function works, however its name needs to be generalised a bit. Separated out from #6739.	2020-11-13 16:24:04 +00:00
Erik Johnston	f21e24ffc2	Add ability for access tokens to belong to one user but grant access to another user. (#8616 ) We do it this way round so that only the "owner" can delete the access token (i.e. `/logout/all` by the "owner" also deletes that token, but `/logout/all` by the "target user" doesn't). A future PR will add an API for creating such a token. When the target user and authenticated entity are different the `Processed request` log line will be logged with a: `{@admin:server as @bob:server} ...`. I'm not convinced by that format (especially since it adds spaces in there, making it harder to use `cut -d ' '` to chop off the start of log lines). Suggestions welcome.	2020-10-29 15:58:44 +00:00
Erik Johnston	a6ea1a957e	Don't pull event from DB when handling replication traffic. (#8669 ) I was trying to make it so that we didn't have to start a background task when handling RDATA, but that is a bigger job (due to all the code in `generic_worker`). However I still think not pulling the event from the DB may help reduce some DB usage due to replication, even if most workers will simply go and pull that event from the DB later anyway. Co-authored-by: Patrick Cloke <clokep@users.noreply.github.com>	2020-10-28 12:11:45 +00:00
Erik Johnston	4215a3acd4	Don't unnecessarily start bg process in replication sending loop. (#8670 )	2020-10-27 17:37:08 +00:00
Erik Johnston	2b7c180879	Start fewer opentracing spans (#8640 ) #8567 started a span for every background process. This is good as it means all Synapse code that gets run should be in a span (unless in the sentinel logging context), but it means we generate about 15x the number of spans as we did previously. This PR attempts to reduce that number by a) not starting one for send commands to Redis, and b) deferring starting background processes until after we're sure they're necessary. I don't really know how much this will help.	2020-10-26 09:30:19 +00:00
Richard van der Hoff	97647b33c2	Replace DeferredCache with LruCache where possible (#8563 ) Most of these uses don't need a full-blown DeferredCache; LruCache is lighter and more appropriate.	2020-10-19 12:20:29 +01:00
Richard van der Hoff	4182bb812f	move DeferredCache into its own module	2020-10-14 23:38:14 +01:00
Richard van der Hoff	9f87da0a84	Rename Cache->DeferredCache	2020-10-14 23:38:14 +01:00
Richard van der Hoff	7eff59ec91	Add some more type annotations to Cache	2020-10-14 23:38:14 +01:00
Erik Johnston	b2486f6656	Fix message duplication if something goes wrong after persisting the event (#8476 ) Should fix #3365.	2020-10-13 12:07:56 +01:00
Erik Johnston	8de3703d21	Make event persisters periodically announce position over replication. (#8499 ) Currently background proccesses stream the events stream use the "minimum persisted position" (i.e. `get_current_token()`) rather than the vector clock style tokens. This is broadly fine as it doesn't matter if the background processes lag a small amount. However, in extreme cases (i.e. SyTests) where we only write to one event persister the background processes will never make progress. This PR changes it so that the `MultiWriterIDGenerator` keeps the current position of a given instance as up to date as possible (i.e using the latest token it sees if its not in the process of persisting anything), and then periodically announces that over replication. This then allows the "minimum persisted position" to advance, albeit with a small lag.	2020-10-12 15:51:41 +01:00
Patrick Cloke	1781bbe319	Add type hints to response cache. (#8507 )	2020-10-09 11:35:11 -04:00
Erik Johnston	5009ffcaa4	Only send RDATA for instance local events. (#8496 ) When pulling events out of the DB to send over replication we were not filtering by instance name, and so we were sending events for other instances.	2020-10-09 13:10:33 +01:00
Patrick Cloke	c9c0ad5e20	Remove the deprecated Handlers object (#8494 ) All handlers now available via get_*_handler() methods on the HomeServer.	2020-10-09 07:24:34 -04:00
Erik Johnston	6c5d5e507e	Add unit test for event persister sharding (#8433 )	2020-10-02 09:57:12 +01:00
Patrick Cloke	4ff0201e62	Enable mypy checking for unreachable code and fix instances. (#8432 )	2020-10-01 08:09:18 -04:00
Erik Johnston	ea70f1c362	Various clean ups to room stream tokens. (#8423 )	2020-09-29 21:48:33 +01:00
Richard van der Hoff	866c84da8d	Add metrics to track success/otherwise of replication requests (#8406 ) One hope is that this might provide some insights into #3365.	2020-09-29 11:06:11 +01:00
Erik Johnston	f112cfe5bb	Fix MultiWriteIdGenerator's handling of restarts. (#8374 ) On startup `MultiWriteIdGenerator` fetches the maximum stream ID for each instance from the table and uses that as its initial "current position" for each writer. This is problematic as a) it involves either a scan of events table or an index (neither of which is ideal), and b) if rows are being persisted out of order elsewhere while the process restarts then using the maximum stream ID is not correct. This could theoretically lead to race conditions where e.g. events that are persisted out of order are not sent down sync streams. We fix this by creating a new table that tracks the current positions of each writer to the stream, and update it each time we finish persisting a new entry. This is a relatively small overhead when persisting events. However for the cache invalidation stream this is a much bigger relative overhead, so instead we note that for invalidation we don't actually care about reliability over restarts (as there's no caches to invalidate) and simply don't bother reading and writing to the new table in that particular case.	2020-09-24 16:53:51 +01:00
Erik Johnston	ac11fcbbb8	Add EventStreamPosition type (#8388 ) The idea is to remove some of the places we pass around `int`, where it can represent one of two things: 1. the position of an event in the stream; or 2. a token that partitions the stream, used as part of the stream tokens. The valid operations are then: 1. did a position happen before or after a token; 2. get all events that happened before or after a token; and 3. get all events between two tokens. (Note that we don't want to allow other operations as we want to change the tokens to be vector clocks rather than simple ints)	2020-09-24 13:24:17 +01:00
Patrick Cloke	8a4a4186de	Simplify super() calls to Python 3 syntax. (#8344 ) This converts calls like super(Foo, self) -> super(). Generated with: sed -i "" -Ee 's/super\([^\(]+\)/super()/g' */.py	2020-09-18 09:56:44 -04:00
Jonathan de Jong	a3f124b821	Switch metaclass initialization to python 3-compatible syntax (#8326 )	2020-09-16 15:15:55 -04:00
Patrick Cloke	aec294ee0d	Use slots in attrs classes where possible (#8296 ) slots use less memory (and attribute access is faster) while slightly limiting the flexibility of the class attributes. This focuses on objects which are instantiated "often" and for short periods of time.	2020-09-14 12:50:06 -04:00
Patrick Cloke	d2a3eb04a4	Fix typos in comments.	2020-09-14 11:46:58 -04:00
Erik Johnston	04cc249b43	Add experimental support for sharding event persister. Again. (#8294 ) This is not ready for production yet. Caveats: 1. We should write some tests... 2. The stream token that we use for events can get stalled at the minimum position of all writers. This means that new events may not be processed and e.g. sent down sync streams if a writer isn't writing or is slow.	2020-09-14 10:16:41 +01:00
Erik Johnston	5d3e306d9f	Clean up `Notifier.on_new_room_event` code path (#8288 ) The idea here is that we pass the `max_stream_id` to everything, and only use the stream ID of the particular event to figure out when the max stream position has caught up to the event and we can notify people about it. This is to maintain the distinction between the position of an item in the stream (i.e. event A has stream ID 513) and a token that can be used to partition the stream (i.e. give me all events after stream ID 352). This distinction becomes important when the tokens are more complicated than a single number, which they will be once we start tracking the position of multiple writers in the tokens. The valid operations here are: 1. Is a position before or after a token 2. Fetching all events between two tokens 3. Merging multiple tokens to get the "max", i.e. `C = max(A, B)` means that for all positions P where P is before A or before B, then P is before C. Future PR will change the token type to a dedicated type.	2020-09-10 13:24:43 +01:00
Patrick Cloke	2ea1c68249	Remove some unused distributor signals (#8216 ) Removes the `user_joined_room` and stops calling it since there are no observers. Also cleans-up some other unused signals and related code.	2020-09-09 12:22:00 -04:00
Erik Johnston	c9dbee50ae	Fixup pusher pool notifications (#8287 ) `pusher_pool.on_new_notifications` expected a min and max stream ID, however that was not what we were passing in. Instead, let's just pass it the current max stream ID and have it track the last stream ID it got passed. I believe that it mostly worked as we called the function for every event. However, it would break for events that got persisted out of order, i.e, that were persisted but the max stream ID wasn't incremented as not all preceding events had finished persisting, and push for that event would be delayed until another event got pushed to the effected users.	2020-09-09 16:56:08 +01:00
Erik Johnston	dc9dcdbd59	Revert "Fixup pusher pool notifications" This reverts commit `e7fd336a53`.	2020-09-09 16:19:22 +01:00
Erik Johnston	e7fd336a53	Fixup pusher pool notifications	2020-09-09 16:17:50 +01:00
Patrick Cloke	c619253db8	Stop sub-classing object (#8249 )	2020-09-04 06:54:56 -04:00
Brendan Abolivier	9f8abdcc38	Revert "Add experimental support for sharding event persister. (#8170 )" (#8242 ) * Revert "Add experimental support for sharding event persister. (#8170)" This reverts commit `82c1ee1c22`. * Changelog	2020-09-04 10:19:42 +01:00
Erik Johnston	82c1ee1c22	Add experimental support for sharding event persister. (#8170 ) This is not ready for production yet. Caveats: 1. We should write some tests... 2. The stream token that we use for events can get stalled at the minimum position of all writers. This means that new events may not be processed and e.g. sent down sync streams if a writer isn't writing or is slow.	2020-09-02 15:48:37 +01:00
Richard van der Hoff	aa07c37cf0	Move and rename `get_devices_with_keys_by_user` (#8204 ) * Move `get_devices_with_keys_by_user` to `EndToEndKeyWorkerStore` this seems a better fit for it. This commit simply moves the existing code: no other changes at all. * Rename `get_devices_with_keys_by_user` to better reflect what it does. * get_device_stream_token abstract method To avoid referencing fields which are declared in the derived classes, make `get_device_stream_token` abstract, and define that in the classes which define `_device_list_id_gen`.	2020-09-01 12:41:21 +01:00
Erik Johnston	3b4556cf87	Fix `wait_for_stream_position` for multiple waiters. (#8196 ) This fixes a bug where having multiple callers waiting on the same stream and position will cause it to try and compare two deferreds, which fails (due to the sorted list having an entry of `Tuple[int, Deferred]`).	2020-08-28 17:12:45 +01:00
Erik Johnston	e3c91a3c55	Make SlavedIdTracker.advance have same interface as MultiWriterIDGenerator (#8171 )	2020-08-26 13:15:20 +01:00
Erik Johnston	c9c544cda5	Remove `ChainedIdGenerator`. (#8123 ) It's just a thin wrapper around two ID gens to make `get_current_token` and `get_next` return tuples. This can easily be replaced by calling the appropriate methods on the underlying ID gens directly.	2020-08-19 13:41:51 +01:00
Patrick Cloke	eebf52be06	Be stricter about JSON that is accepted by Synapse (#8106 )	2020-08-19 07:26:03 -04:00
Erik Johnston	76d21d14a0	Separate `get_current_token` into two. (#8113 ) The function is used for two purposes: 1) for subscribers of streams to get a token they can use to get further updates with, and 2) for replication to track position of the writers of the stream. For streams with a single writer the two scenarios produce the same result, however the situation becomes complicated for streams with multiple writers. The current `MultiWriterIdGenerator` does not correctly handle the first case (which is not an issue as its only used for the `caches` stream which nothing subscribes to outside of replication).	2020-08-19 10:39:31 +01:00
Patrick Cloke	ac77cdb64e	Add a shadow-banned flag to users. (#8092 )	2020-08-14 12:37:59 -04:00
David Vo	4dd27e6d11	Reduce unnecessary whitespace in JSON. (#7372 )	2020-08-07 08:02:55 -04:00
Patrick Cloke	d4a7829b12	Convert synapse.api to async/await (#8031 )	2020-08-06 08:30:06 -04:00

1 2 3 4 5 ...

711 Commits