An architecture that tracks data distribution, data generation and calculate impact of data amplification.