definition | Collection of generally unrefined or pre-processed (or raw) data captured and consolidated from multiple sources. It contains data with limited structure where data models are applied a posteriori, that is, after loading data into the lake. That is also known as data-first or schema on-read. Data in the lake might not be harmonised or integrated, and curation and quality assurance are not required. | ||||||
---|---|---|---|---|---|---|---|
editorial note | Expert review decision, 2021-22: Add | ||||||
type |
|
||||||
in scheme |
|
||||||
top concept of | rdmt original |