Tracking the Leaker: An Encodable Watermarking Method for Dataset Intellectual Property Protection

No Thumbnail Available
File version
Author(s)
Shang, Yifan
Xue, Mingfu
Zhang, Leo Yu
Zhang, Yushu
Liu, Weiqiang
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
2024
Size
File type(s)
Location

Changsha, China

License
Abstract

Presently, numerous enterprises provide machine learning cloud services. However, the service provider may exploit user-uploaded data for unauthorized model retraining or illicit collection of user data for commercial model development. This study introduces a traceable dataset watermarking technique designed to ascertain the trustworthiness of third-party providers offering machine learning cloud services. In the event of a data breach, the source can be traced back to the suspicious third-party responsible for data leakage. Specifically, we propose a method that employs the clean-label backdoor attack framework to infer whether a third-party model is trained using user data. A watermark, associated with the encoding and designed as a trigger, is injected into the dataset through a trained autoencoder. Experimental evaluation on three datasets proves the effectiveness of the proposed method, yielding over 93% accuracy on average under normal conditions. A series of pruning and fine-tuning attacks were carried out on the method, with the results indicating that these attacks have a minimal impact and confirming the method’s robustness.

Journal Title
Conference Title

ACM-TURC '24: Proceedings of the ACM Turing Award Celebration Conference - China 2024

Book Title
Edition
Volume
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
Item Access Status
Note
Access the data
Related item(s)
Subject
Persistent link to this record
Citation

Shang, Y; Xue, M; Zhang, LY; Zhang, Y; Liu, W, Tracking the Leaker: An Encodable Watermarking Method for Dataset Intellectual Property Protection, ACM-TURC '24: Proceedings of the ACM Turing Award Celebration Conference - China 2024, 2024, pp. 114-119