2018-11-15 16:03:12 +01:00
|
|
|
```
|
2019-03-01 19:24:37 +01:00
|
|
|
_ ___ _
|
|
|
|
| | |__ \ | |
|
|
|
|
_ __ ___ __ _| |_ ) | ___ __ _____| |__ Trashing your meta,
|
|
|
|
| '_ ` _ \ / _` | __| / / |___|\ \ /\ / / _ \ '_ \ keeping your data,
|
|
|
|
| | | | | | (_| | |_ / /_ \ V V / __/ |_) | within your browser.
|
|
|
|
|_| |_| |_|\__,_|\__|____| \_/\_/ \___|_.__/
|
|
|
|
```
|
2018-11-15 16:03:12 +01:00
|
|
|
|
|
|
|
This is an online version of [mat2](https://0xacab.org/jvoisin/mat2).
|
|
|
|
Keep in mind that this is a beta version, don't rely on it for anything
|
|
|
|
serious, yet.
|
|
|
|
|
2018-12-25 19:24:42 +01:00
|
|
|
# Demo instance
|
|
|
|
|
|
|
|
There is a demo instance deployed a [mat2-web.dustri.org](https://mat2-web.dustri.org).
|
2019-03-09 11:40:33 +01:00
|
|
|
Please don't upload any sensitive files to it.
|
2018-12-25 19:24:42 +01:00
|
|
|
|
2019-11-15 01:07:36 -08:00
|
|
|
# Vue Frontend
|
|
|
|
data:image/s3,"s3://crabby-images/8f03f/8f03f15f99b6e55dc0ccfdd35569d0becbc0ed21" alt="Frontend GIF Preview"
|
|
|
|
There is a SPA Frontend available at https://0xacab.org/jfriedli/mat2-quasar-frontend. It consumes
|
|
|
|
the RESTful API of this project. As a fallback for non JS users it redirects to this web app.
|
|
|
|
To set it up checkout the [Readme](https://0xacab.org/jfriedli/mat2-quasar-frontend/blob/master/README.md).
|
|
|
|
|
2018-11-15 16:03:12 +01:00
|
|
|
# How to deploy it?
|
|
|
|
|
2019-07-10 14:24:37 +00:00
|
|
|
mat2 is available in [Debian stable](https://packages.debian.org/stable/mat2).
|
2018-11-15 16:03:12 +01:00
|
|
|
|
|
|
|
```
|
2019-02-23 13:27:33 +01:00
|
|
|
# apt install uwsgi uwsgi-plugin-python3 git mat2
|
|
|
|
# apt install nginx-light # if you prefer nginx
|
|
|
|
# apt install apache2 libapache2-mod-proxy-uwsgi # if you prefer Apache2
|
2018-11-15 16:03:12 +01:00
|
|
|
# cd /var/www/
|
|
|
|
# git clone https://0xacab.org/jvoisin/mat2-web.git
|
|
|
|
# mkdir ./mat2-web/uploads/
|
|
|
|
# chown -R www-data:www-data ./mat2-web
|
|
|
|
```
|
|
|
|
|
2019-03-01 19:24:37 +01:00
|
|
|
Since [uWSGI](https://uwsgi-docs.readthedocs.io/en/latest/) isn't fun to
|
|
|
|
configure, feel free to copy
|
|
|
|
[this file](https://0xacab.org/jvoisin/mat2-web/tree/master/config/uwsgi.config)
|
|
|
|
to `/etc/uwsgi/apps-enabled/mat2-web.ini` and
|
|
|
|
[this one](https://0xacab.org/jvoisin/mat2-web/tree/master/config/nginx.config)
|
2019-08-28 08:33:28 -07:00
|
|
|
to `/etc/nginx/sites-enabled/mat2-web`.
|
2018-11-15 16:03:12 +01:00
|
|
|
|
2019-02-23 13:27:33 +01:00
|
|
|
Nginx is the recommended web engine, but you can also use Apache if you prefer,
|
|
|
|
by copying [this file](https://0xacab.org/jvoisin/mat2-web/tree/master/config/apache2.config)
|
|
|
|
to your `/etc/apache2/sites-enabled/mat2-web` file.
|
2018-12-22 14:20:06 +01:00
|
|
|
|
2020-03-27 10:59:16 -07:00
|
|
|
Configure the following environment variables:
|
|
|
|
|
2020-03-31 20:33:06 +02:00
|
|
|
- `MAT2_ALLOW_ORIGIN_WHITELIST=https://myhost1.org https://myhost2.org`
|
|
|
|
Note that you can add multiple hosts from which you want to accept API requests. These need to be separated by
|
|
|
|
a space. **IMPORTANT:** The default value if the variable is not set is: `Access-Control-Allow-Origin: *`
|
2020-03-27 10:59:16 -07:00
|
|
|
- `MAT2_MAX_FILES_BULK_DOWNLOAD=10` Max number of files that can be grouped for a bulk download.
|
2020-03-31 20:33:06 +02:00
|
|
|
Note: Each file has a max file size of 16mb
|
2019-09-21 05:58:05 -07:00
|
|
|
|
2020-03-31 20:33:06 +02:00
|
|
|
- `MAT2_MAX_FILE_AGE_FOR_REMOVAL=900` Seconds a file in the upload folder is kept.
|
|
|
|
After that it will be deleted. Default `15 * 60`
|
|
|
|
- `MAT2_WEB_DOWNLOAD_FOLDER` Define the upload folder path. Defaults to: `./uploads/`
|
|
|
|
|
2019-03-01 19:24:37 +01:00
|
|
|
Finally, restart uWSGI and your web server:
|
2018-12-24 13:50:18 +01:00
|
|
|
|
2018-12-22 14:20:06 +01:00
|
|
|
```
|
|
|
|
systemctl restart uwsgi
|
2018-12-24 13:50:18 +01:00
|
|
|
systemctl restart nginx/apache/…
|
2018-12-22 14:20:06 +01:00
|
|
|
```
|
|
|
|
|
2018-11-15 16:03:12 +01:00
|
|
|
It should now be working.
|
2018-12-16 21:54:25 +01:00
|
|
|
|
2020-06-26 17:05:08 +02:00
|
|
|
Note for reverse proxies: Include the Host header to ensure all generated urls are correct.
|
|
|
|
e.g. for Nginx: `proxy_set_header Host $host;`
|
|
|
|
|
2019-03-01 17:20:45 +01:00
|
|
|
# Deploy via Ansible
|
|
|
|
|
2019-03-01 19:24:37 +01:00
|
|
|
If you happen to be using [Ansible](https://www.ansible.com/), there's an
|
|
|
|
Ansible role to deploy mat2-web on Debian, thanks to the amazing
|
|
|
|
[systemli](https://www.systemli.org/en/index.html) people:
|
|
|
|
[ansible-role-mat2-web](https://github.com/systemli/ansible-role-mat2-web)
|
|
|
|
|
|
|
|
The role installs mat2-web as a uWSGI service, and runs it as a dedicated
|
2019-03-09 11:40:33 +01:00
|
|
|
system user, installs bubblewrap to sandbox mat2 and creates a garbage
|
|
|
|
collector cronjob to remove leftover files. Besides, it can create a
|
2019-03-01 19:24:37 +01:00
|
|
|
[dm-crypt](https://en.wikipedia.org/wiki/Dm-crypt) volume with random key for
|
|
|
|
the uploads folder, to ensure that the uploaded files won't be recoverable
|
|
|
|
between reboots.
|
2019-03-01 17:20:45 +01:00
|
|
|
|
2019-07-02 13:22:33 -07:00
|
|
|
# Development
|
|
|
|
Install docker and docker-compose and then run `docker-compose up` to setup
|
|
|
|
the docker dev environment. Mat2-web is now accessible on your host machine at `localhost:5000`.
|
|
|
|
Every code change triggers a restart of the app.
|
|
|
|
If you want to add/remove dependencies you have to rebuild the container.
|
|
|
|
|
2019-07-09 14:56:21 -07:00
|
|
|
# RESTful API
|
|
|
|
|
2020-07-13 09:12:30 +02:00
|
|
|
If you go to https://api.matweb.info/apidocs/ you can find a Swagger documentation.
|
2020-07-13 08:42:30 +02:00
|
|
|
|
2019-07-09 14:56:21 -07:00
|
|
|
## Upload Endpoint
|
|
|
|
|
2020-07-13 08:58:12 +02:00
|
|
|
**Endpoint:** `/api/upload`
|
2019-07-09 14:56:21 -07:00
|
|
|
|
|
|
|
**HTTP Verbs:** POST
|
|
|
|
|
|
|
|
**Body:**
|
|
|
|
```json
|
|
|
|
{
|
|
|
|
"file_name": "my-filename.jpg",
|
|
|
|
"file": "iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAADUlEQVR42mNk+M9QDwADhgGAWjR9awAAAABJRU5ErkJggg=="
|
|
|
|
}
|
|
|
|
```
|
|
|
|
|
|
|
|
The `file_name` parameter takes the file name.
|
|
|
|
The `file` parameter is the base64 encoded file which will be cleaned.
|
|
|
|
|
|
|
|
**Example Response:**
|
|
|
|
```json
|
|
|
|
{
|
2020-05-15 14:38:13 +02:00
|
|
|
"inactive_after_sec": 120,
|
2019-07-09 14:56:21 -07:00
|
|
|
"output_filename": "fancy.cleaned.jpg",
|
2019-08-12 07:28:08 -07:00
|
|
|
"mime": "image/jpg",
|
2019-07-09 14:56:21 -07:00
|
|
|
"key": "81a541f9ebc0233d419d25ed39908b16f82be26a783f32d56c381559e84e6161",
|
2020-04-27 12:58:12 -07:00
|
|
|
"secret": "44deb60b5febbd466e042f4172d36bcc5f7eb2eb6791d6e93191c378a381ae7c",
|
2019-07-09 14:56:21 -07:00
|
|
|
"meta": {
|
|
|
|
"BitDepth": 8,
|
|
|
|
"ColorType": "RGB with Alpha",
|
|
|
|
"Compression": "Deflate/Inflate",
|
|
|
|
"Filter": "Adaptive",
|
|
|
|
"Interlace": "Noninterlaced"
|
|
|
|
},
|
|
|
|
"meta_after": {},
|
2020-04-27 12:58:12 -07:00
|
|
|
"download_link": "http://localhost:5000/download/81a541f9ebc0233d419d25ed39908b16f82be26a783f32d56c381559e84e6161/44deb60b5febbd466e042f4172d36bcc5f7eb2eb6791d6e93191c378a381ae7c/fancy.cleaned.jpg"
|
2019-07-09 14:56:21 -07:00
|
|
|
}
|
|
|
|
```
|
|
|
|
|
|
|
|
## Supported Extensions Endpoint
|
|
|
|
|
|
|
|
**Endpoint:** `/api/extension`
|
|
|
|
|
|
|
|
**HTTP Verbs:** GET
|
|
|
|
|
|
|
|
**Example Response (shortened):**
|
|
|
|
```json
|
|
|
|
[
|
|
|
|
".asc",
|
|
|
|
".avi",
|
|
|
|
".bat",
|
|
|
|
".bmp",
|
|
|
|
".brf",
|
|
|
|
".c",
|
|
|
|
".css",
|
|
|
|
".docx",
|
|
|
|
".epub"
|
|
|
|
]
|
|
|
|
```
|
|
|
|
|
2019-09-21 05:58:05 -07:00
|
|
|
**Endpoint:** `/api/download/bulk`
|
|
|
|
|
|
|
|
This endpoint allows you to bulk download several files
|
|
|
|
which you uploaded beforehand. Note that the `download_list`
|
|
|
|
MUST contain more than two files. The max length is configurable
|
|
|
|
(default is 10).
|
|
|
|
|
|
|
|
**HTTP Verbs:** POST
|
|
|
|
|
|
|
|
**Body:**
|
|
|
|
```json
|
|
|
|
{
|
|
|
|
"download_list": [
|
|
|
|
{
|
|
|
|
"file_name": "uploaded_file_name.jpg",
|
2020-04-27 12:58:12 -07:00
|
|
|
"key": "uploaded_file_key",
|
2020-05-08 17:42:03 +02:00
|
|
|
"secret": "uploaded_file_secret"
|
2019-09-21 05:58:05 -07:00
|
|
|
}
|
|
|
|
]
|
|
|
|
}
|
|
|
|
```
|
|
|
|
|
|
|
|
The `file_name` parameter takes the file name from a previously uploaded file.
|
|
|
|
The `key` parameter is the key from a previously uploaded file.
|
|
|
|
|
|
|
|
**Example Response:**
|
|
|
|
```json
|
|
|
|
{
|
2020-05-15 14:38:13 +02:00
|
|
|
"inactive_after_sec": 120,
|
2019-09-21 05:58:05 -07:00
|
|
|
"output_filename": "files.2cd225d5-2d75-44a2-9f26-e120a87e4279.cleaned.zip",
|
|
|
|
"mime": "application/zip",
|
|
|
|
"key": "5ee4cf8821226340d3d5ed16bd2e1b435234a9ad218f282b489a85d116e7a4c4",
|
2020-04-27 12:58:12 -07:00
|
|
|
"secret": "44deb60b5febbd466e042f4172d36bcc5f7eb2eb6791d6e93191c378a381ae7c",
|
2019-09-21 05:58:05 -07:00
|
|
|
"meta_after": {},
|
|
|
|
"download_link": "http://localhost/api/download/5ee4cf8821226340d3d5ed16bd2e1b435234a9ad218f282b489a85d116e7a4c4/files.2cd225d5-2d75-44a2-9f26-e120a87e4279.cleaned.zip"
|
|
|
|
}
|
|
|
|
```
|
|
|
|
|
2019-08-28 08:33:28 -07:00
|
|
|
# Docker
|
2019-10-24 14:08:48 +02:00
|
|
|
|
2019-08-28 08:33:28 -07:00
|
|
|
There are two Dockerfiles present in this repository. The file called `Dockerfile.development` is used for development
|
|
|
|
and `Dockerfile.production` is used for production deployments.
|
|
|
|
|
|
|
|
You can find the automated docker builds in the registry of this
|
|
|
|
repository: https://0xacab.org/jvoisin/mat2-web/container_registry
|
|
|
|
|
2020-03-28 15:17:45 +01:00
|
|
|
### Building the production image
|
|
|
|
Build command: `docker build -f Dockerfile.production -t mat-web .`
|
|
|
|
|
2020-04-12 16:08:52 +02:00
|
|
|
Run it: `docker run -ti -p8181:8080 --read-only --tmpfs /tmp --tmpfs /run/uwsgi --tmpfs=/app/upload --security-opt=no-new-privileges --security-opt=seccomp=./config/seccomp.json mat-web:latest`
|
2020-03-28 15:17:45 +01:00
|
|
|
|
2020-03-28 15:22:31 +01:00
|
|
|
This does mount the upload folder as tmpfs and servers the app on `localhost:8181`.
|
2020-03-28 15:17:45 +01:00
|
|
|
|
2020-04-02 23:19:50 +02:00
|
|
|
##### Podman
|
|
|
|
Build: `podman build -f Dockerfile.production -t matweb-podman .`
|
|
|
|
|
2020-04-12 15:52:29 +02:00
|
|
|
Run: `podman run -ti -p8181:8080 --read-only --tmpfs /tmp --tmpfs /run/uwsgi --tmpfs=/app/upload --security-opt=no-new-privileges --security-opt=seccomp=./config/seccomp.json matweb-podman:latest`
|
2020-04-02 23:19:50 +02:00
|
|
|
|
2019-10-24 14:08:48 +02:00
|
|
|
# Configuration
|
|
|
|
|
|
|
|
The default settings from `main.py` may be overridden by adding a `config.py`
|
|
|
|
file and add custom values for the relevant flask config variables. E.g.:
|
|
|
|
|
|
|
|
```
|
|
|
|
MAX_CONTENT_LENGTH = 32 * 1024 * 1024 # 32MB
|
|
|
|
```
|
|
|
|
|
|
|
|
See [Flask configuration docs](http://exploreflask.com/en/latest/configuration.html)
|
|
|
|
for further information.
|
|
|
|
|
2019-02-26 12:40:11 +01:00
|
|
|
# Custom templates
|
|
|
|
|
|
|
|
You can override the default templates from `templates/` by putting replacements
|
|
|
|
into the directory path that's configured in `app.config['CUSTOM_TEMPLATES_DIR']`
|
|
|
|
(default `custom_templates/`).
|
|
|
|
|
2019-02-22 21:20:51 +01:00
|
|
|
# Threat model
|
|
|
|
|
|
|
|
- An attacker in possession of the very same file that a user wants to clean,
|
|
|
|
along with its names, can perform a denial of service by continually
|
|
|
|
requesting this file, and getting it before the user.
|
|
|
|
- An attacker in possession of only the name of a file that a user wants to
|
|
|
|
clean can't perform a denial of service attack, since the path to download
|
2019-03-04 22:37:14 +00:00
|
|
|
the cleaned file is not only dependent of the name, but also the content.
|
2019-02-22 21:20:51 +01:00
|
|
|
- The server should do its very best to delete files as soon as possible.
|
|
|
|
|
2018-12-16 21:54:25 +01:00
|
|
|
# Licenses
|
|
|
|
|
|
|
|
- mat2-web is under MIT
|
|
|
|
- The [raleway](https://github.com/impallari/Raleway/) font is under OFL1.1
|
|
|
|
- [normalize.css](https://github.com/necolas/normalize.css/) is under MIT
|
|
|
|
- [skeleton](http://getskeleton.com/) is under MIT
|