In complex ETL processes, ________ is used for managing dependencies and workflow orchestration.

  • Apache Airflow
  • Informatica
  • Power BI
  • Tableau
Apache Airflow is a popular open-source platform used in complex ETL processes for managing dependencies and orchestrating workflows. It allows the design and scheduling of workflows as directed acyclic graphs (DAGs), providing a flexible and scalable solution for ETL pipeline management.

What is a stored procedure in a DBMS and when is it used?

  • A procedure that is stored in a file system.
  • A stored procedure is a precompiled collection of one or more SQL statements that can be executed as a single unit.
  • A type of index in a database.
  • It is a virtual table used for optimizing query performance.
Stored procedures are used to encapsulate a series of SQL statements for execution as a single unit. They enhance code modularity, security, and performance by reducing the need to send multiple queries to the database server.

In a case study where a company is facing declining sales, what analysis technique would be most effective in identifying the root causes?

  • Cohort Analysis
  • Regression Analysis
  • SWOT Analysis
  • Trend Analysis
Regression Analysis would be most effective in identifying the root causes of declining sales. It allows for the exploration of relationships between variables and can help uncover factors contributing to the decline. SWOT Analysis, Trend Analysis, and Cohort Analysis focus on different aspects and may not provide the same depth of insight.

To ensure clarity in communication, a data analyst should avoid using too much _______.

  • Ambiguity
  • Complexity
  • Jargon
  • Redundancy
A data analyst should avoid using too much complexity in their communication to ensure clarity. This includes avoiding unnecessary technical jargon, which may hinder understanding for non-technical stakeholders.

How does Git's distributed version control system differ from centralized systems?

  • Centralized systems have better performance than Git.
  • Each developer has a local repository, allowing for offline work and faster operations.
  • Git does not support branching and merging.
  • Git requires a constant connection to the central server for all operations.
In a distributed version control system like Git, each developer has a local repository, enabling them to work offline and perform operations more quickly. This decentralization is a key differentiator from centralized systems, which rely on a central server for version control.

How do you select distinct values from a column in a SQL table?

  • DIFFERENT
  • DISTINCT
  • SELECT UNIQUE
  • UNIQUE
The DISTINCT keyword in SQL is used to retrieve unique values from a specified column in a table. It ensures that only distinct values are returned in the result set, eliminating duplicate entries. This is helpful when you want to see a list of unique values in a specific column.

The process of _________ involves identifying, understanding, and documenting how data moves across an organization.

  • Data Analysis
  • Data Extraction
  • Data Mapping
  • Data Validation
The process of Data Mapping involves identifying, understanding, and documenting how data moves across an organization. It helps in understanding data flows and relationships between different systems.

Which SQL clause is used to filter records based on more than one condition?

  • AND
  • FILTER
  • OR
  • WHERE
The OR operator in SQL is used to filter records based on more than one condition. It allows you to retrieve records that satisfy either of the specified conditions, providing flexibility in querying the database. The WHERE clause is used for a single condition, and AND combines conditions, but OR broadens the filter.

In project management, which KPI is crucial for evaluating the efficiency of resource utilization?

  • Earned Value (EV)
  • Resource Utilization Rate
  • Return on Investment (ROI)
  • Schedule Variance (SV)
Resource Utilization Rate is crucial for evaluating the efficiency of resource utilization in project management. It measures the percentage of time that resources are actively contributing to project work. ROI, EV, and SV are important KPIs but focus on different aspects of project performance.

When executing a, b = 10, 20; print(____) to swap and print a and b, the output is 20, 10.

  • a
  • a, b
  • b
  • b, a
The correct syntax to swap the values of variables a and b is b, a = a, b. This statement exchanges the values of a and b, resulting in the output 20, 10.