Scenario: A company has employees and projects. An employee can work on multiple projects, and a project can have multiple employees working on it. Which type of relationship would you use to represent this scenario in an ERD?

  • Many-to-Many
  • Many-to-One
  • One-to-Many
  • One-to-One
In this scenario, a Many-to-Many relationship is appropriate. This allows each employee to work on multiple projects, and each project to have multiple employees associated with it. It involves a junction table to manage the relationship.

In an ERD, an _______ key is a key that consists of more than one attribute.

  • Composite
  • Foreign
  • Primary
  • Unique
In an ERD, a composite key is a key that consists of more than one attribute. It is used to uniquely identify an entity when a single attribute is not sufficient. Composite keys are common in situations where a combination of attributes is needed for uniqueness.

What is aggregation in the context of data modeling?

  • Breaking down a large dataset into smaller pieces
  • Combining multiple entities into a single entity
  • Creating relationships between entities
  • Summarizing and combining data at a higher level
Aggregation in data modeling involves summarizing and combining data at a higher level. It is used to provide insights into the overall trends and patterns within a dataset, often involving the use of aggregation functions like SUM, AVG, COUNT, etc., to analyze and present data in a more meaningful way.

What is the central table in a Star Schema?

  • Dimension Table
  • Fact Table
  • Lookup Table
  • Reference Table
In a Star Schema, the central table is the Fact Table. This table contains the core data of the schema and is usually surrounded by Dimension Tables that provide context and additional details for the data in the Fact Table. The Fact Table typically contains numerical measures or facts that business users want to analyze.

How does indexing improve query performance in a database?

  • Decreases the overall database size
  • Increases the complexity of queries
  • Reduces the number of records in a table
  • Speeds up data retrieval by providing a quick access path
Indexing improves query performance by creating a quick access path to the data. It reduces the number of records that need to be scanned, making data retrieval faster for specific queries.

What is a transitive dependency in the context of normalization?

  • Dependency between non-prime attributes
  • Dependency where a non-prime attribute determines a prime attribute
  • Dependency where a non-prime attribute determines another non-prime attribute
  • Dependency where a prime attribute determines another prime attribute
In normalization, a transitive dependency occurs when a non-prime attribute determines another non-prime attribute. It violates the principles of normalization, and resolving such dependencies is crucial for achieving higher normal forms.

How does SQL handle data manipulation compared to UML?

  • SQL focuses on the structure of classes and objects
  • SQL is specific to NoSQL databases
  • UML is a visual representation language, whereas SQL is text-based for database manipulation
  • UML is more efficient in handling complex queries
SQL and UML serve different purposes in data modeling. SQL is a text-based language primarily used for querying and manipulating databases, while UML is a visual modeling language. SQL focuses on the specifics of database operations, whereas UML provides a broader visual representation of system structure and behavior.

What is the primary goal of clustering in database management?

  • To group similar data together
  • To improve database backups
  • To increase database security
  • To reduce database size
The primary goal of clustering in database management is to group similar data together. By organizing similar data into clusters, it becomes easier to retrieve relevant information and perform data analysis tasks. Clustering can also improve query performance and data organization in the database.

The _______ constraint allows you to define a condition that must be met for the data to be valid.

  • Check
  • Integrity
  • Referential
  • Validation
The Check constraint in a database allows you to define a condition or expression that must be satisfied for the data to be considered valid. It is used to ensure that data adheres to specific criteria, providing data integrity at the column level.

How do NoSQL databases handle consistency in distributed systems compared to traditional relational databases?

  • Emphasizing centralized control
  • Relying on eventual consistency
  • Using ACID properties
  • Utilizing distributed transactions
NoSQL databases often rely on eventual consistency in distributed systems compared to traditional relational databases. Unlike traditional databases that emphasize strong consistency through distributed transactions and ACID properties, NoSQL databases prioritize low-latency operations and high availability, accepting temporary inconsistencies that will eventually be resolved.