What is the difference between lossless and lossy compression techniques?

  • Discards some data to achieve higher compression
  • Only applicable to text data
  • Preserves all original data during compression
  • Reduces the size of data without any impact
Lossless compression preserves all original data during the compression process. It is commonly used for text and data where no loss of information is acceptable. In contrast, lossy compression sacrifices some data to achieve higher compression ratios, often used in multimedia applications where slight quality loss is acceptable.

What is the primary data structure used in graph databases?

  • Graph
  • List
  • Table
  • Tree
The primary data structure used in graph databases is a graph. Unlike traditional relational databases that use tables, a graph database employs a graph structure with nodes and edges to represent and store data. This allows for efficient representation of relationships between entities.

An entity that inherits properties and relationships from another entity is known as a _______.

  • Child entity
  • Derived entity
  • Linked entity
  • Parent entity
An entity that inherits properties and relationships from another entity is known as a "Child entity" or "Subtype." It represents a more specialized category within a Generalization and Specialization hierarchy.

In which normal form are all non-prime attributes fully functionally dependent on the primary key?

  • Boyce-Codd Normal Form (BCNF)
  • First Normal Form (1NF)
  • Second Normal Form (2NF)
  • Third Normal Form (3NF)
All non-prime attributes being fully functionally dependent on the primary key characterizes a table in Boyce-Codd Normal Form (BCNF). BCNF is a higher level of normalization that ensures the removal of certain types of anomalies and dependencies in the database.

_______ is a storage optimization strategy that involves storing frequently accessed data in memory for faster access.

  • Caching
  • Data Warehousing
  • Indexing
  • Sharding
Caching is a storage optimization strategy that involves storing frequently accessed data in memory. This helps in reducing the time it takes to retrieve the data since accessing data from memory is faster compared to fetching it from disk. Caching is commonly used in database systems to improve overall performance.

How does clustering differ from partitioning in database management?

  • Clustering enhances data security, while partitioning improves data distribution
  • Clustering focuses on optimizing data access patterns, while partitioning emphasizes data replication
  • Clustering involves grouping related data together on the same node, while partitioning involves dividing a database into smaller, independent parts
  • Clustering reduces data redundancy, while partitioning increases fault tolerance
Clustering involves grouping related data together on the same node or server to improve data locality and reduce network overhead. On the other hand, partitioning involves dividing a database into smaller, independent parts to distribute data across multiple nodes or servers, improving scalability and performance.

Which type of diagrams can be created using ER diagram tools?

  • Entity-Relationship Diagrams
  • Flowcharts
  • Mind maps
  • UML diagrams
ER diagram tools specialize in creating Entity-Relationship Diagrams (ERDs). These diagrams depict the relationships between entities in a database, helping users understand the data structure and connections within a system.

What is the primary purpose of using UML in data modeling?

  • To create user interfaces
  • To define programming logic
  • To implement database queries
  • To visualize and design software systems
The primary purpose of using UML (Unified Modeling Language) in data modeling is to visualize and design software systems. UML provides a standardized way to represent and communicate the structure, behavior, and architecture of software systems, including aspects of data modeling such as classes, objects, and their relationships.

Clustering helps to group similar data _______ for efficient retrieval.

  • Based on types
  • Geographically
  • Proactively
  • Together
Clustering helps to group similar data together for efficient retrieval. This technique organizes data based on similarities, making it easier to locate and retrieve relevant information during queries. It is particularly useful for optimizing search and retrieval operations in databases.

Which of the following statements best describes denormalization?

  • It increases the complexity of queries
  • It is only applicable to small databases
  • It optimizes for write operations
  • It reduces redundancy in the database
Denormalization increases the complexity of queries by introducing redundancy. It's a deliberate design choice to improve read performance at the cost of some additional storage and potential data inconsistency.