hiexam
databricks · Certified-Data-Engineer-Associate · Q427 · multiple_choice · topic_1

A data engineer has realized that the data files associated with a Delta table are incredibly small. They want to compa…

A data engineer has realized that the data files associated with a Delta table are incredibly small. They want to compact the small files to form larger files to improve performance. Which keyword can be used to compact the small files?
  • A.OPTIMIZE
  • B.VACUUM
  • C.COMPACTION
  • D.REPARTITION
Explanation
The OPTIMIZE command is used to compact small files into larger ones, which helps improve the performance of Delta Lake tables. It consolidates small files into fewer larger files to reduce the overhead associated with having many small files. This process is often referred to as "compaction" but the specific keyword in Databricks Delta Lake is OPTIMIZE.

Reference: examtopics_top_comment

Practice with progress tracking

Sign in to track wrong answers, get spaced-repetition reminders, and run timed exam mode.