T4Tutorials .PK

VU Past Papers CS614 – Data Warehousing Solved Subjective (Spring/Fall 2011 & 2012)


1. Aggregates Awareness


2. One-to-One vs One-to-Many Transformation


3. Cube Creation in ROLAP


4. Timestamps


5. Factors Behind Poor Data Quality (5 marks)

Types of problems: dummy values, missing data, multipurpose fields, cryptic/contradicting data, reused keys, non-unique identifiers.


6. Denormalization Technique


7. Data Granularity


8. Summarization During Transformation


9. Clustering & Associative Rules


10. Splitting Single Field


11. Additive vs Non-Additive Facts (5 marks)


12. Expression Partitioning


13. HOLAP Features


14. Data Validation & Profiling


15. Full vs Incremental Extraction


16. Merge/Purge Problem


17. Data Quality Definitions


18. Erroneous Data


19. Aggregation


20. Fact Table


21. Active DWH


22. Lexical Errors


23. ELT


24. Physical Extraction


25. Data Cleansing Steps

  1. Elementizing
  2. Standardizing
  3. Verifying
  4. Matching
  5. Householding
  6. Documenting

26. Cube Partitioning


27. OLAP Types


28. Data Duplication in Source

MIDTERM Spring 2011 – CS614

1. Effects of source data duplication on analysis in a DW (5 marks)

2. Features of Dimensional Modeling (5 marks)

3. Benefits of CDC in modern systems (3 marks)

4. Statistical analyzer: distributive, algebraic, holistic transformations (3 marks)

5. Round robin distribution is pre-defined? (2 marks)

6. MOLAP vs DOLAP (2 marks)

7. Offline vs Online extraction (2 marks)

8. Difference between ER and DM (5 marks)

9. Orr’s Laws of Data Quality (5 marks)


21. Snowflake Schema (2 marks)

22. Why aggregation & summarization are required? (2 marks)

23. Condition for smart tools to construct less detailed aggregates (3 marks)

24. Web scraping (3 marks)

25. Data loading strategies (5 marks)

26. Drawbacks of MOLAP & curse of dimensionality (5 marks)


21. MOLAP features (2 marks)

22. Steps in cleansing data (Marks missing)

23. Features of Star Schema (3 marks)

24. Multi-pass BSN approach (3 marks)

25. Benefits of HOLAP & DOLAP over MOLAP & ROLAP (5 marks)

26. Relationship between data quality & application value (5 marks)

Exit mobile version