synapse-product

mirror of https://git.anonymousland.org/anonymousland/synapse-product.git synced 2024-12-18 06:04:19 -05:00

Author	SHA1	Message	Date
Erik Johnston	437a99fb99	Fix user_daily_visits to not have duplicate rows for UA. (#8654 ) * Fix user_daily_visits to not have duplicate rows for UA. Fixes #8641. * Newsfile * Fix typo. Co-authored-by: Patrick Cloke <clokep@users.noreply.github.com>	2020-10-26 13:16:32 +00:00
Patrick Cloke	de5cafe980	Add type hints to profile and base handlers. (#8609 )	2020-10-21 06:44:31 -04:00
Patrick Cloke	9e0f22874f	Consistently use wrap_as_background_task in more places (#8599 )	2020-10-20 11:29:38 -04:00
Will Hunt	626b8f0846	Move schema file for as_device_stream (#8590 ) * Move schema file * Add a . * Add matching changelog entry * Fix sqlite	2020-10-20 10:18:55 +01:00
Vasilis Gerakaris	34c20493b9	Drop unused `device_max_stream_id` table (#8589 ) Signed-off-by: Vasilis Gerakaris <vasilis.gerakaris@navarino.gr>	2020-10-19 19:06:54 +01:00
Richard van der Hoff	903d11c43a	Add `DeferredCache.get_immediate` method (#8568 ) * Add `DeferredCache.get_immediate` method A bunch of things that are currently calling `DeferredCache.get` are only really interested in the result if it's completed. We can optimise and simplify this case. * Remove unused 'default' parameter to DeferredCache.get() * another get_immediate instance	2020-10-19 15:00:12 +01:00
Richard van der Hoff	97647b33c2	Replace DeferredCache with LruCache where possible (#8563 ) Most of these uses don't need a full-blown DeferredCache; LruCache is lighter and more appropriate.	2020-10-19 12:20:29 +01:00
Jonathan de Jong	79c1f973ce	Pre-emptively fix synapse.storage.types.Connection for future mypy release (#8577 ) Fix the Connection protocol according to typeshed's assertions about sqlite3.Connection	2020-10-17 09:51:38 +01:00
Patrick Cloke	1b70662be9	Clean-up old transaction IDs on the background worker. (#8544 )	2020-10-16 12:06:17 -04:00
Will Hunt	c276bd9969	Send some ephemeral events to appservices (#8437 ) Optionally sends typing, presence, and read receipt information to appservices.	2020-10-15 12:33:28 -04:00
Richard van der Hoff	0a08cd1065	Merge pull request #8548 from matrix-org/rav/deferred_cache Rename Cache to DeferredCache, and related changes	2020-10-15 11:42:07 +01:00
Neil Johnson	1f39155071	Include user agent in user daily visits table (#8503 ) Include user agent in user daily visits table.	2020-10-15 10:36:40 +01:00
Richard van der Hoff	4182bb812f	move DeferredCache into its own module	2020-10-14 23:38:14 +01:00
Richard van der Hoff	9f87da0a84	Rename Cache->DeferredCache	2020-10-14 23:38:14 +01:00
Erik Johnston	19b15d63e8	Use autocommit mode for single statement DB functions. (#8542 ) Autocommit means that we don't wrap the functions in transactions, and instead get executed directly. Introduced in #8456. This will help: 1. reduce the number of `could not serialize access due to concurrent delete` errors that we see (though there are a few functions that often cause serialization errors that we don't fix here); 2. improve the DB performance, as it no longer needs to deal with the overhead of `REPEATABLE READ` isolation levels; and 3. improve wall clock speed of these functions, as we no longer need to send `BEGIN` and `COMMIT` to the DB. Some notes about the differences between autocommit mode and our default `REPEATABLE READ` transactions: 1. Currently `autocommit` only applies when using PostgreSQL, and is ignored when using SQLite (due to silliness with [Twisted DB classes](https://twistedmatrix.com/trac/ticket/9998)). 2. Autocommit functions may get retried on error, which means they can get applied twice (or more) to the DB (since they are not in a transaction the previous call would not get rolled back). This means that the functions need to be idempotent (or otherwise not care about being called multiple times). Read queries, simple deletes, and updates/upserts that replace rows (rather than generating new values from existing rows) are all idempotent. 3. Autocommit functions no longer get executed in [`REPEATABLE READ`](https://www.postgresql.org/docs/current/transaction-iso.html) isolation level, and so data can change queries, which is fine for single statement queries.	2020-10-14 15:50:59 +01:00
Erik Johnston	618d405a32	Remove racey assertion in MultiWriterIDGenerator (#8530 ) We asserted that the IDs returned by postgres sequence was greater than any we had seen, however this is technically racey as we may update the current positions out of order. We now assert that the sequences are correct on startup, so the assertion is no longer really required, so we remove them.	2020-10-14 15:40:06 +01:00
Brendan Abolivier	3ee97a2748	Make sure a retention policy is a state event (#8527 ) * Make sure a retention policy is a state event * Changelog	2020-10-14 12:00:52 +01:00
Patrick Cloke	629a951b49	Move additional tasks to the background worker, part 4 (#8513 )	2020-10-13 08:20:32 -04:00
Erik Johnston	b2486f6656	Fix message duplication if something goes wrong after persisting the event (#8476 ) Should fix #3365.	2020-10-13 12:07:56 +01:00
Erik Johnston	8de3703d21	Make event persisters periodically announce position over replication. (#8499 ) Currently background proccesses stream the events stream use the "minimum persisted position" (i.e. `get_current_token()`) rather than the vector clock style tokens. This is broadly fine as it doesn't matter if the background processes lag a small amount. However, in extreme cases (i.e. SyTests) where we only write to one event persister the background processes will never make progress. This PR changes it so that the `MultiWriterIDGenerator` keeps the current position of a given instance as up to date as possible (i.e using the latest token it sees if its not in the process of persisting anything), and then periodically announces that over replication. This then allows the "minimum persisted position" to advance, albeit with a small lag.	2020-10-12 15:51:41 +01:00
Erik Johnston	5009ffcaa4	Only send RDATA for instance local events. (#8496 ) When pulling events out of the DB to send over replication we were not filtering by instance name, and so we were sending events for other instances.	2020-10-09 13:10:33 +01:00
Patrick Cloke	fe0f4a3591	Move additional tasks to the background worker, part 3 (#8489 )	2020-10-09 07:37:51 -04:00
Patrick Cloke	a93f3121f8	Add type hints to some handlers (#8505 )	2020-10-09 07:20:51 -04:00
Hubert Chathi	a97cec18bb	Invalidate the cache when an olm fallback key is uploaded (#8501 )	2020-10-08 13:24:46 -04:00
Patrick Cloke	e4f72ddc44	Move additional tasks to the background worker (#8458 )	2020-10-07 11:27:56 -04:00
Erik Johnston	ae5b2a72c0	Reduce serialization errors in MultiWriterIdGen (#8456 ) We call `_update_stream_positions_table_txn` a lot, which is an UPSERT that can conflict in `REPEATABLE READ` isolation level. Instead of doing a transaction consisting of a single query we may as well run it outside of a transaction.	2020-10-07 15:15:57 +01:00
Erik Johnston	52a50e8686	Use vector clocks for room stream tokens. (#8439 ) Currently when using multiple event persisters we (in the worst case) don't tell clients about events until all event persisters have persisted new events after the original event. This is a suboptimal, especially if one of the event persisters goes down. To handle this, we encode the position of each event persister in the room tokens so that we can send events to clients immediately. To reduce the size of the token we do two things: 1. We create a unique immutable persistent mapping between instance names and a generated small integer ID, which we can encode in the tokens instead of the instance name; and 2. We encode the "persisted upto position" of the room token and then only explicitly include instances that have positions strictly greater than that. The new tokens look something like: `m3478~1.3488~2.3489`, where the first number is the min position, and the subsequent `-` separated pairs are the instance ID to positions map. (We use `.` and `~` as separators as they're URL safe and not already used by `StreamToken`).	2020-10-07 15:15:33 +01:00
Patrick Cloke	b460a088c6	Add typing information to the device handler. (#8407 )	2020-10-07 08:58:21 -04:00
Hubert Chathi	4cb44a1585	Add support for MSC2697: Dehydrated devices (#8380 ) This allows a user to store an offline device on the server and then restore it at a subsequent login.	2020-10-07 08:00:17 -04:00
Hubert Chathi	3cd78bbe9e	Add support for MSC2732: olm fallback keys (#8312 )	2020-10-06 13:26:29 -04:00
Richard van der Hoff	f31f8e6319	Remove stream ordering from Metadata dict (#8452 ) There's no need for it to be in the dict as well as the events table. Instead, we store it in a separate attribute in the EventInternalMetadata object, and populate that on load. This means that we can rely on it being correctly populated for any event which has been persited to the database.	2020-10-05 14:43:14 +01:00
Patrick Cloke	c5251c6fbd	Do not assume that account data is of the correct form. (#8454 ) This fixes a bug where `m.ignored_user_list` was assumed to be a dict, leading to odd behavior for users who set it to something else.	2020-10-05 09:28:05 -04:00
Erik Johnston	e3debf9682	Add logging on startup/shutdown (#8448 ) This is so we can tell what is going on when things are taking a while to start up. The main change here is to ensure that transactions that are created during startup get correctly logged like normal transactions.	2020-10-02 15:20:45 +01:00
Erik Johnston	ec10bdd32b	Speed up unit tests when using PostgreSQL (#8450 )	2020-10-02 15:09:31 +01:00
Patrick Cloke	62894673e6	Allow background tasks to be run on a separate worker. (#8369 )	2020-10-02 08:23:15 -04:00
Richard van der Hoff	462e681c79	Synapse 1.21.0rc2 (2020-10-02) ============================== Features -------- - Convert additional templates from inline HTML to Jinja2 templates. ([\#8444](https://github.com/matrix-org/synapse/issues/8444)) Bugfixes -------- - Fix a regression in v1.21.0rc1 which broke thumbnails of remote media. ([\#8438](https://github.com/matrix-org/synapse/issues/8438)) - Do not expose the experimental `uk.half-shot.msc2778.login.application_service` flow in the login API, which caused a compatibility problem with Element iOS. ([\#8440](https://github.com/matrix-org/synapse/issues/8440)) - Fix malformed log line in new federation "catch up" logic. ([\#8442](https://github.com/matrix-org/synapse/issues/8442)) - Fix DB query on startup for negative streams which caused long start up times. Introduced in [\#8374](https://github.com/matrix-org/synapse/issues/8374). ([\#8447](https://github.com/matrix-org/synapse/issues/8447)) -----BEGIN PGP SIGNATURE----- iQEzBAABCAAdFiEEv27Axt/F4vrTL/8QOSor00I9eP8FAl93FccACgkQOSor00I9 eP9/Egf7B4YOF6tniyAXxZvmvFOwV1WNw4sbFmF+czUKHBTAwS/Ij9MbutulD4OB +yqHAvu15qUCQR/G+KGjyHBDtESEUtn5SRy8znLYlR2n3qfEdEpd5y6LJSq4s7sr NjFVNVI1g5L8PmbvvWCINfpPm2JSm8zyOdyxy4KZifex1B+8YgPILeQOB59sWL/H 1maFbHCgepqO3jotsA8PUXQZx5oScABmqYYe92b4sLna00uFBq2NWp0NA654dRqK VRFlGzId1fZNWTy1jzfOY2sJKpBCy4cMrtfGJ/eqMtryHqbnBFT6hgB8FyTNg0h0 oew+BLV/mcJLcvB0ALRMFS7xZHdoxQ== =+3N3 -----END PGP SIGNATURE----- Merge tag 'v1.21.0rc2' into develop Synapse 1.21.0rc2 (2020-10-02) ============================== Features -------- - Convert additional templates from inline HTML to Jinja2 templates. ([\#8444](https://github.com/matrix-org/synapse/issues/8444)) Bugfixes -------- - Fix a regression in v1.21.0rc1 which broke thumbnails of remote media. ([\#8438](https://github.com/matrix-org/synapse/issues/8438)) - Do not expose the experimental `uk.half-shot.msc2778.login.application_service` flow in the login API, which caused a compatibility problem with Element iOS. ([\#8440](https://github.com/matrix-org/synapse/issues/8440)) - Fix malformed log line in new federation "catch up" logic. ([\#8442](https://github.com/matrix-org/synapse/issues/8442)) - Fix DB query on startup for negative streams which caused long start up times. Introduced in [\#8374](https://github.com/matrix-org/synapse/issues/8374). ([\#8447](https://github.com/matrix-org/synapse/issues/8447))	2020-10-02 12:59:17 +01:00
Erik Johnston	695240d34a	Fix DB query on startup for negative streams. (#8447 ) For negative streams we have to negate the internal stream ID before querying the DB. The effect of this bug was to query far too many rows, slowing start up time, but we would correctly filter the results afterwards so there was no ill effect.	2020-10-02 12:22:19 +01:00
Patrick Cloke	4ff0201e62	Enable mypy checking for unreachable code and fix instances. (#8432 )	2020-10-01 08:09:18 -04:00
Erik Johnston	7941372ec8	Make token serializing/deserializing async (#8427 ) The idea is that in future tokens will encode a mapping of instance to position. However, we don't want to include the full instance name in the string representation, so instead we'll have a mapping between instance name and an immutable integer ID in the DB that we can use instead. We'll then do the lookup when we serialize/deserialize the token (we could alternatively pass around an `Instance` type that includes both the name and ID, but that turns out to be a lot more invasive).	2020-09-30 20:29:19 +01:00
Richard van der Hoff	20e7c4de26	Add an improved "forward extremities" metric Hopefully, N(extremities) * N(state_events) is a more realistic approximation to "how big a problem is this room?".	2020-09-30 16:49:15 +01:00
Richard van der Hoff	6d2d42f8fb	Rewrite BucketCollector This was a bit unweildy for what I wanted: in particular, I wanted to assign each measurement straight into a bucket, rather than storing an intermediate Counter which didn't do any bucketing at all. I've replaced it with something that is hopefully a bit easier to use. (I'm not entirely sure what the difference between a HistogramMetricFamily and a GaugeHistogramMetricFamily is, but given our counters can go down as well as up the latter sounds more accurate?)	2020-09-30 16:49:15 +01:00
Erik Johnston	ea70f1c362	Various clean ups to room stream tokens. (#8423 )	2020-09-29 21:48:33 +01:00
Erik Johnston	b1433bf231	Don't table scan events on worker startup (#8419 ) * Fix table scan of events on worker startup. This happened because we assumed "new" writers had an initial stream position of 0, so the replication code tried to fetch all events written by the instance between 0 and the current position. Instead, set the initial position of new writers to the current persisted up to position, on the assumption that new writers won't have written anything before that point. * Consider old writers coming back as "new". Otherwise we'd try and fetch entries between the old stale token and the current position, even though it won't have written any rows. Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com> Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com>	2020-09-29 16:42:19 +01:00
Richard van der Hoff	2649d545a5	Mypy fixes for `synapse.handlers.federation` (#8422 ) For some reason, an apparently unrelated PR upset mypy about this module. Here are a number of little fixes.	2020-09-29 15:57:36 +01:00
Will Hunt	8676d8ab2e	Filter out appservices from mau count (#8404 ) This is an attempt to fix #8403.	2020-09-29 13:11:02 +01:00
Erik Johnston	bd380d942f	Add checks for postgres sequence consistency (#8402 )	2020-09-28 18:00:30 +01:00
Matthew Hodgson	4b3a1faa08	typo	2020-09-28 00:23:35 +01:00
Tdxdxoz	abd04b6af0	Allow existing users to login via OpenID Connect. (#8345 ) Co-authored-by: Benjamin Koch <bbbsnowball@gmail.com> This adds configuration flags that will match a user to pre-existing users when logging in via OpenID Connect. This is useful when switching to an existing SSO system.	2020-09-25 07:01:45 -04:00
Erik Johnston	3e87d79e1c	Fix schema delta for servers that have not backfilled (#8396 ) Fixes #8395.	2020-09-25 09:58:32 +01:00
Erik Johnston	f112cfe5bb	Fix MultiWriteIdGenerator's handling of restarts. (#8374 ) On startup `MultiWriteIdGenerator` fetches the maximum stream ID for each instance from the table and uses that as its initial "current position" for each writer. This is problematic as a) it involves either a scan of events table or an index (neither of which is ideal), and b) if rows are being persisted out of order elsewhere while the process restarts then using the maximum stream ID is not correct. This could theoretically lead to race conditions where e.g. events that are persisted out of order are not sent down sync streams. We fix this by creating a new table that tracks the current positions of each writer to the stream, and update it each time we finish persisting a new entry. This is a relatively small overhead when persisting events. However for the cache invalidation stream this is a much bigger relative overhead, so instead we note that for invalidation we don't actually care about reliability over restarts (as there's no caches to invalidate) and simply don't bother reading and writing to the new table in that particular case.	2020-09-24 16:53:51 +01:00

1 2 3 4 5 ...

3897 Commits