Big Data: Preserving Privacy by Design

November 26, 2013
Rae Lynn Mitchell
Public Health

Information systems in the health sector have undergone significant changes making it possible to collect, store, and process huge amounts of digital records that may hold the key to future population health breakthroughs. However, linking between diverse data systems is complicated by privacy issues as well as data being recorded differently (10/12/58 or Oct. 12, 1958), changing over time (maiden vs. married last names), erroneous (transposed dates during data entry) or simply not included. So how do we use this new wealth of data, commonly termed “big data,” in a way that maintains privacy and assures it is accurate as well?

Hye-Chung Kum, Ph.D., associate professor at the Texas A&M Health Science Center School of Public Health, thinks the answer lies in developing new methodologies for extracting information and outlines her framework in the Journal of the American Medical Informatics Association.

In “Privacy Preserving Interactive Record Linkage (PPIRL),” Kum emphasizes that it is critical to understand the distinction between identity disclosure (e.g., who the person is) and sensitive attribute disclosure (e.g., does this person have cancer). She maintains that identity disclosure has little potential for harm on its own though the sensitive attribute disclosure is what results in harm.

“Privacy preserving data integration is key to any data intensive population research,” states Kum. “If we define the privacy goal of privacy preserving record linkage as a guarantee against attribute disclosure, we can develop systems that allow both privacy protection and high quality record linkage.”

Human-based third-party linkage centers have been the accepted norm internationally to date. However, Kum believes a more flexible computerized third-party linkage platform, Secure Decoupled Linkage (SDLink), should be considered based on three core privacy principles. First, SDLink separates the identifying data from the sensitive data using encryption. Second, through chaffing (adding fake data) and changing the label of the data set, SDLink prevents attribute inference that can occur through group disclosure. For example, if someone you know is on the cancer registry (group disclosure), they must have cancer. However, this attribute disclosure can be eliminated if you knew that the list was fake data (people who did not have cancer are also on the list) or if you did not know this was a cancer registry. Finally, identity disclosure is minimized by recoding of variables (e.g., gender is recoded as being different, same, or missing rather than male or female). As a result, only the information that is essential for record linkage is revealed.

“Never before in history have we had more data to use for population research,” says Kum. “But to use big data appropriately, we must understand the 4V’s – lots of (volume), complex (variety), data that are continuously generated (velocity), and also have much uncertainty (veracity).”

Kum believes variety and veracity are the two main challenges for big data in health services research, and the key is “to understand the minimum information required for accurate linkage and then to design protocols to reveal, in a secure manner, only that information.”

Media contact: media@tamu.edu

Benika Dixon named Kavli Fellow for cutting-edge public health research on vulnerable populations

child sits on couch next to father while talking with a professional health care worker

Researchers investigate possible rural-urban divide in HIV risk behaviors

Dee-Anna Green presents to a group of students seated at dinner tables

School of Public Health etiquette dinner helps prepare students for workplace leadership roles

Why trees and green spaces are good for our health and wellbeing

Maternal health partnership to connect Coastal Bend mothers with resources

Texas A&M-McAllen students host mental health awareness fair in Little Mexico

Facebook

Texas A&M School of Public Health

Instagram

We are here to answer your questions, so schedule an appointment today with SPH Graduate Student Services! Call 979.436.9356 or email SPH-GSS@tamu.edu.

Open

Happy Earth Day from the School of Public Health! 👍🌳

Repost via @tamu

#12thforhealth #tamuSPH #tamuHealth

Open

Texas A&M School of Public Health was well represented at this week’s American Academy of Health Behavior (AAHB) annual conference in Savannah, Georgia. Among the competitors were Alfiya Shaik, a DrPH student in the Department of Health Behavior, and Wah Wah Myint, a post-doctoral fellow in the Community Center for Health and Aging, who received the People’s Choice Award for her 3-Minute Thesis presentation, “Intimate Partner Violence and Help-Seeking Behavior.” Congrats to both competitors and to Wah Wah Myint for receiving this honor!

L-R: Adam Barry, Aditi Tomar, Alfiya Shaik, Meg Patterson, Marcia Ory, Wah Wah Myint, Matthew Lee Smith, Chimuanya "Princess" Osuji, Tyler Prochnow

Open

Congratulations to our students who received their Aggie Rings!!

#TAMU #PublicHealthTAMU
#Whoop #Aggieland #GigEm #Aggies

Open

Mastering appropriate mealtime behaviors during her job search led Ashna Patel, who is set to graduate in May with a master’s degree in health policy and management, to attend the Texas A&M School of Public Health etiquette dinner at Miramont Country Club.

“I have always been nervous about eating in front of people, so I wanted to learn more. I’m glad the event also included things like tipping, to-go boxes and other manners—skills students don’t learn in the classroom.”

Read more at: tx.ag/sphetiquettedinner

Open

Thanks to Texas A&M School of Public Health occupational health and safety coursework, Grant Bergeron ‘24 is being prepared to take a proactive approach to safeguarding employees’ overall health. Following graduation this May, Grant will begin working for Chevron! Thanks, Grant, for sharing your story! tx.ag/GrantNIOSH

Our Occupational Health and Safety MPH graduates are highly sought after by industry with average starting salary of $76,000. Application deadline for the MPH program is May 1st, so apply today!

#workersafety #occupationalsafety #occupationalhealth #MPH #12thforHealth #TAMUHealth #ThisIsPublicHealth #Aggies #Gigem #Aggieland #Aggiebound

Open

Twitter

Follow @TAMU_SPH https://platform.twitter.com/widgets.js

Find Your Path in Public Health

View Degree Programs

A news publication of Texas A&M Health, Vital Record offers insight on the latest in health, medicine and scientific discovery from experts across our five schools and numerous centers and institutes.

Submit a Story Idea

Big Data: Preserving Privacy by Design

Share This

Related Posts

Benika Dixon named Kavli Fellow for cutting-edge public health research on vulnerable populations

Researchers investigate possible rural-urban divide in HIV risk behaviors

School of Public Health etiquette dinner helps prepare students for workplace leadership roles

In The News

Why trees and green spaces are good for our health and wellbeing

Maternal health partnership to connect Coastal Bend mothers with resources

Texas A&M-McAllen students host mental health awareness fair in Little Mexico

Facebook

Instagram

Twitter

Find Your Path in Public Health

About Vital Record