Data deduplication has been widely adopted by cloud storage provides to reduce the operation cost, including the storage, bandwidth, and management. Unfortunately, data deduplication itself incurs new security and privacy threats. This article first describes the security and privacy threats, and then has an overview of the state-of-the-art cloud data deduplication techniques with security and privacy preservation.