CODASPY 2017 Panel

Trustworthy Data Science

Thursday, March 23, 1:30 PM - 2:45 PM


Elisa Bertino, Lorenzo Cavallaro, Alexandros Kapravelos, and Adam Lee


Adam Doupé


Much of the research that our community publishes is based on data, which we call broadly as data science. However, an open question remains: are the results of data science trustworthy, and how can we increase our trust in data science? Accomplishing this goal is difficult, as we must trust the inputs, systems, and results of data science. This panel will discuss the current state of trustworthy data science, and explore possible technical, legal, and cultural solutions that can increase our trust in the input, systems, and results of data science.

Some broad questions that the panel will consider:

- How does the community view research that replicates other studies? Should we, as a community, encourage reproducible research and replication studies? If so, how can we actually achieve this goal?

- What prevents researchers from sharing data sets? What could be done to alleviate these problems? Are there technical research solutions that could prove useful?

- With many of our computing systems relying on machine learning algorithms, how much trust can we place in these systems? Can there be a trustworthy machine learning future, and if so how can we create that future?

- How does one know that a dataset is representative and unbiased? Are there techniques in collecting data that can improve the representativeness of the underlying population?

- How can we trust the output of research that is performed on  datasets that the researchers are under legal obligations not to share (due to NDAs, privacy issues, etc.)?

Elisa Bertino, Professor, Purdue University


Elisa Bertino joined Purdue in January 2004 as Professor in Computer Science and research director at CERIAS. Her research interests cover many areas in the fields of information security and database systems. Her research combines both theoretical and practical aspects, addressing applications on a number of domains, such as medicine and humanities. Current research includes: access control systems, secure publishing techniques and secure broadcast for XML data; advanced RBAC models and foundations of access control models; trust negotiation languages and privacy; data mining and security; multi-strategy filtering systems for Web pages and sites; security for grid computing systems; integration of virtual reality techniques and databases; and geographical information systems and spatial databases.

Professor Bertino serves or has served on the editorial boards of several journals - many of which are related to security, such as the ACM Transactions on Information and System Security, the IEEE Security & Privacy Magazine, and IEEE Transactions on Dependable and Secure Computing. She is currently serving as program chair of the 36th International Conference on Very Large Data Bases (VLDB 2010). Professor Bertino is a Fellow of the Institute of Electrical and Electronics Engineers and a Fellow of ACM. She received the IEEE Computer Society Technical Achievement award in 2002 for outstanding contributions to database systems and database security and advanced data management systems, and received the 2005 Tsutomu Kanai Award by the IEEE Computer Society for pioneering and innovative research contributions to secure distributed systems.

Lorenzo Cavallaro, Reader (Associate Professor), Royal Holloway, University of London


Lorenzo Cavallaro is a Reader (Associate Professor) of Information Security in the School of Mathematics and Information Security at Royal Holloway, University of London. His research focuses largely on systems security and he has founded and is leading the recently-established Systems Security Research Lab (S2Lab), which builds on program analysis and machine learning to devise novel techniques to protect systems from a broad range of threats, including those perpetrated by malicious software. Prior to joining Royal Holloway, Lorenzo was a Post Doctorate researcher in the Systems & Security group at Vrije Universiteit Amsterdam, the Security Group at UC Santa Barbara and a long-term Visiting Scholar at Stony Brook University. He publishes and sits on program committees of well-known and premiere security conferences, and his research is funded by the UK EPSRC, McAfee, and Royal Holloway. He teaches Software Security at graduate level within the MSc in Information Security and Malicious Software at undergraduate level within the BSc in Computer Science with Information Security, and has been delivering a MOOC on Malicious Software and its Underground Economy since 2013. Such classes align perfectly with his research and help shape the next generation of professionals and scientists.

Alexandros Kapravelos, Assistant Professor, North Carolina State University


Alexandros Kapravelos is an Assistant Professor in the Department of Computer Science at North Carolina State University. His research interests lie in the area of computer security and he is particularly interested in browser security and building systems that solve security problems. In the past, he was the lead developer of Wepawet, a system that detects drive-by downloads with the use of an emulated browser, Hulk, the first system that automatically identifies malicious browser extensions, and Revolver, a system that detects evasive drive-by download attempts. He is currently interested in internet-wide attacks, building scalable systems to protect users and improving privacy on the web.

Adam J. Lee, Associate Professor, University of Pittsburgh


Adam J. Lee is currently an Associate Professor of Computer Science at the University of Pittsburgh, where he previously held the position of Assistant Professor (2008-2014). Prior to joining the University of Pittsburgh, he received the MS (2005) and PhD (2008) degrees in Computer Science from the University of Illinois at Urbana-Champaign, and received his BS in Computer Science from Cornell University (2003). His research interests lie at the intersection of the computer security, privacy, and distributed systems fields.  Dr. Lee's research has been supported by the NSF and DARPA, and he is an NSF CAREER award recipient. For more information, please see