TY - GEN
T1 - Academic Data Privacy-Preserving using Centralized and Distributed Systems
T2 - 2024 6th International Conference on Big-data Service and Intelligent Computation, BDSIC 2024
AU - Lamaazi, Hanane
AU - Alneyadi, Aysha Saeed Mohammed
AU - Serhani, Mohamed Adel
N1 - Publisher Copyright:
© 2024 Copyright held by the owner/author(s).
PY - 2024/5/29
Y1 - 2024/5/29
N2 - Data privacy has become a critical concern in a set of domains, including healthcare, education, traffic monitoring, etc., due to technology's high deployment and massive data collection. In education, academic institutions have started taking several precautions to prevent data misuse, especially students' information, unauthorized access to the institution's databases, and any security breaches that can negatively affect the institutions' activities and objectives and students' lives. Protecting student information has become a priority, especially with the emergence of online learning, to create a safe environment, foster trust, and comply with relevant laws. Existing data privacy techniques are mostly deployed in centralized platforms, which can increase the data processing complexity and response time. However, the emergence of distributed systems helped to improve the infrastructure's security and users' privacy. Also, it reduced the processing and transmission time while providing high-quality services. This paper proposes a comparative study of deploying distributed and centralized platforms while preserving education data privacy. The distributed system is developed using k-means clustering, while data privacy is ensured by applying the k-anonymity technique using both generalization and suppression. As a result, the centralized system outperforms the distributed one in terms of β-likeliness, t-closeness, and δ-disclosure, with less suppression. Also, centralized platforms require less execution time and higher memory allocation than distributed ones.
AB - Data privacy has become a critical concern in a set of domains, including healthcare, education, traffic monitoring, etc., due to technology's high deployment and massive data collection. In education, academic institutions have started taking several precautions to prevent data misuse, especially students' information, unauthorized access to the institution's databases, and any security breaches that can negatively affect the institutions' activities and objectives and students' lives. Protecting student information has become a priority, especially with the emergence of online learning, to create a safe environment, foster trust, and comply with relevant laws. Existing data privacy techniques are mostly deployed in centralized platforms, which can increase the data processing complexity and response time. However, the emergence of distributed systems helped to improve the infrastructure's security and users' privacy. Also, it reduced the processing and transmission time while providing high-quality services. This paper proposes a comparative study of deploying distributed and centralized platforms while preserving education data privacy. The distributed system is developed using k-means clustering, while data privacy is ensured by applying the k-anonymity technique using both generalization and suppression. As a result, the centralized system outperforms the distributed one in terms of β-likeliness, t-closeness, and δ-disclosure, with less suppression. Also, centralized platforms require less execution time and higher memory allocation than distributed ones.
KW - Additional Keywords and Phrases Education
KW - anonymity
KW - centralized systems
KW - data privacy
KW - distributed systems
UR - http://www.scopus.com/inward/record.url?scp=85205445374&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85205445374&partnerID=8YFLogxK
U2 - 10.1145/3686540.3686542
DO - 10.1145/3686540.3686542
M3 - Conference contribution
AN - SCOPUS:85205445374
T3 - ACM International Conference Proceeding Series
SP - 8
EP - 16
BT - BDSIC 2024 - 2024 6th International Conference on Big-data Service and Intelligent Computation
PB - Association for Computing Machinery
Y2 - 29 May 2024 through 31 May 2024
ER -