add all communication notes

2025-06-24 22:50:28 -04:00 · 2023-07-15 16:41:11 -07:00 · 2023-07-15 16:41:11 -07:00 · c8d550c140
commit c8d550c140
parent 9aa9e827c0
2 changed files with 161 additions and 71 deletions
--- a/README.md
+++ b/README.md
@ -61,27 +61,44 @@
 <br>

 ---
-### source code and snippets
+### scripts and snippets

 <br>


+#### services and pubs
+
 * **[docker](code/docker)**
 * **[kubernetes](code/kubernetes):**
  * **[spin up a node server](code/kubernetes/node-server-example)**
  * **[kustomize for deployment](code/kubernetes/kustomize)**
  * **[python cdk for deployment](code/kubernetes/python-cdk)**
+* **[kafka (long pooling)](code/kafka)**
+
+<br>
+
+#### cloud
+
 * **[aws](code/aws)**
 * **[gcp](code/gcp)**
+
+<br>
+#### management
+
 * **[chef](code/chef)**
-* **[kafka](code/kafka)**
+
+
+<br>
+
+#### learning
+
 * **[protocol demos](code/protocol_demos/)**

 <br>

 ---

-### more resources
+### external resources

 <br>

--- a/communication/README.md
+++ b/communication/README.md
@ -19,7 +19,7 @@
 #### the basic idea

 1. clients sends a request
-        - the request structure is defined by both client and server and has a boundary.
+        - the request structure is defined by both client and server and has a boundary
 2. server parses the request
        - the parsing cost is not cheap (e.g. `json` vs. `xml` vs. protocol buffers)
        - for example, for a large image, chunks can be sent, with a request per chunk
@ -117,6 +117,7 @@ curl -v --trace marinasouza.xyz
 <br>

 #### Async workload is everywhere
+
 - async programming (promises, futures)
 - async backend processing
 - async commits in postgres
@ -134,25 +135,21 @@ curl -v --trace marinasouza.xyz

 #### pros and coins

-    - real time
+- real-time
 - the client must be online (connected to the server)
 - the client must be able to handle the load
-    - polling is preferred for light clients
+- polling is preferred for light clients.
+- used by  RabbitMQ (clients consume the queues, and the messages are pushed to the clients)

 <br>

-#### basic idea
+#### the basic idea

 1. client connects to a server
 2. server sends data to the client
 3. client doesn't have to request anything
 4. protocol must be bidirectional

-<br>
-#### used in
-
-    - RabbitMQ (clients consume the queues, and the messages are pushed to the clients)
-

 <br>

@ -162,11 +159,12 @@ curl -v --trace marinasouza.xyz

 <br>

-* used when a request takes long time to process (e.g., upload a video) and very simple to build.
-* however, it can be too chatting, use too much network bandwidth and backend resources.
+* used when a request takes long time to process (e.g., upload a video) and very simple to build
+* however, it can be too chatting, use too much network bandwidth and backend resources

 <br>
-#### basic idea
+
+#### the basic idea
 
 1. client sends a request
 2. server responds immediately with a handle
@ -189,9 +187,9 @@ curl -v --trace marinasouza.xyz

 <br>

-#### basic idea
+#### the basic idea
+

-<br>

 1. clients sends a request
 2. server responds immediately with a handle
@ -209,11 +207,11 @@ curl -v --trace marinasouza.xyz

 <br>

-* one request with a long response, but the client must be online and be able to handle the response.
+* one request with a long response, but the client must be online and be able to handle the response

 <br>

-#### basic idea
+#### the basic idea

 1. a response has start and end
 2. client sends a request
@ -229,6 +227,17 @@ curl -v --trace marinasouza.xyz

 ### Publish Subscribe (Pub/Sub)

+<br>
+
+* one publisher has many reader (and there can be many publishers)
+* relevant when there are many servers (e.g., upload, compress, format, notification)
+* great for microservices as it scales with multiple receivers
+* loose coupling (clients are not connected to each other and works while clients not running)
+* however, you cannot know if the consumer/subscriber got the message or got it twice, etc.
+* also, it might result on network saturation and extra complexity
+* used by RabbitQ and Kafka
+
+

 <br>

@ -236,6 +245,12 @@ curl -v --trace marinasouza.xyz

 ### Multiplexing vs. Demultiplexing

+<br>
+
+
+* used by HTTP/2, QUIC, connection pool, MPTCP
+* connection pooling is a technique where you can spin several backend connections and keep them "hot"
+

 <br>

@ -244,11 +259,69 @@ curl -v --trace marinasouza.xyz
 ### Stateful vs. Stateless


+<br>
+
+* a very contentious topic: is state stored in the backend? how do you rely on the state of an application, system, or protocol?
+* **stateful backend**: store state about clients in its memory and depends on the information being there
+* **stateless backend**: client is responsible to "transfer the state" with every request (you may store but can safely lose it).
+
+<br>
+
+#### Stateless backends
+
+* stateless backends can still store data somewhere else
+* the backend remain stateless but the system is stateful (can you restart the backend during idle time while the client workflow continues to work?)
+
+<br>
+
+#### Stateful backend
+
+* the server generate a session, store locally, and return to the user
+* the client check if the session is in memory to authenticate and return
+* if the backend is restarted, sessions are empty (it never relied on the databases)
+
+<br>
+
+#### Stateless vs. Stateful protocols
+
+* the protocols can be designed to store date
+* TCP is stateful: sequences, connection file descriptor
+* UDP is stateless: DNS send queryID in UDP to identify queries
+* QUIC is stateful but because it sends connectionID to identify connection, it transfer the state across the protocol
+* you can build a stateless protocol on top of a stateful one and vice versa (e.g., HTTP on top of TCP, with cookies)
+
+<br>
+
+#### Complete stateless systems
+
+* stateless systems are very rare
+* state is carried with every request
+* a backend service that relies completely on the input
+* **JWT (JSON Web Token)**, everything is in the token and you cannot mark it as invalid
+
+
+
+
 <br>

 ---

 ### Sidecar Pattern

+<br>
+
+* every protocol requires a library, but changing the library is hard as the app is entrenched to it and breaking changes backward compatibility
+* sidecar pattern is the idea of delegating communication through a proxy with a rich library (and the client has a thin library)
+* in this case, every client has a sidecar proxy
+* pros: it's language agnostic, provides extra security, service discovery, caching.
+* cons: complexity, latency
+
+<br>
+
+#### Examples
+
+* service mesh proxies (Linkerd, Istio, Envoy)
+* sidecar proxy container (must be layer 7 proxy)
+

 <br>