Non functional requirements

1. Security and compliance

For private document, only users with permission should be able access the document.
Is expected to have in flight and at rest encryption for documents.
Is expected to have in flight encryption for operations.
Is expected to implement firewall rules.
Is expected to avoid most common attacks (DDoS, XSS, CSRF and SQL Injection).
Is expected to provide authentication and authorization.
Should implement local throttling or debounce when sending operations to sync service to avoid DDoS.
All document access and modifications should be logged with user identity, timestamp, and operation type.
Audit logs should be retained according to retention policies.

System should provide continuous document editing for users offline and availability requirements focus on synchronization and conflict resolution when reconnecting.
Availability requirements focus on backend services for real-time synchronization and collaboration, rather than local offline editing.
Multi-region replication with automatic conflict resolution ensures consistency when offline changes are merged.

Should aim to prevent loss of collaborative operations, using snapshots and replication strategies.

Is expected to provide offline support to collaborations even without internet connectivity.
Is expected to use LSEQ to handle sequential and ordered.
LSEQ algorithm should be a great solution to ensure sequential, ordered and idempotent operations while optimizing the data structure to support bigger documents.

Is expected to use an implementation on the synchronization service (for conflicts merge) with NodeJs streams to reduce memory overhead in each instance.
Is expected to use Kafka persisted streams to keep track of checkpoints to avoid data loss in case of any failure or interruption during stream processing.

System should support horizontal scaling across multiple instances to handle peak loads.
Is expected to use autoscaling strategy.
Is expected to use cluster mode with NodeJs to use all available cores of each instance CPU.

Is expected to provide local operations with sub-50ms response times with instanteneous renderizations.
Should ensure online syncrhonization with a p99 of sub-200ms response times for collaborative operations.
Is expected to provide Content Delivery Network (CDN) to optimize to edge users.
Is expected to provide edge caching.
Is expected a low latency for global users with multi-region strategy.

Should maintain data integrity across distributed components.
Is expected to have an eventual consistency across multi-region databases and services.

Should provide logs, distributed traces, metrics, alarms and dashboards to monitor and provide support for incidents.