Scenario: A telecommunications company is experiencing challenges with storing and processing large volumes of streaming data from network devices. As a data engineer, how would you design a scalable and fault-tolerant storage architecture to address these challenges?

  • Amazon Redshift
  • Apache HBase + Apache Spark Streaming
  • Apache Kafka + Apache Cassandra
  • Google BigQuery
To address the challenges faced by the telecommunications company, I would design a scalable and fault-tolerant storage architecture using Apache Kafka for real-time data ingestion and Apache Cassandra for distributed storage. Apache Kafka would handle streaming data ingestion from network devices, ensuring data durability and fault tolerance with its replication mechanisms. Apache Cassandra, being a distributed NoSQL database, offers linear scalability and fault tolerance, making it suitable for storing large volumes of streaming data with high availability. This architecture provides a robust solution for storing and processing streaming data in a telecommunications environment.
Add your answer
Loading...

Leave a comment

Your email address will not be published. Required fields are marked *