SATURDAY, October 5th
SINGLE TRACK
Welcoming Remarks
Richard De Veaux, Williams College
Lisa Singh, Georgetown University
PANEL: Getting Students Engaged in Social GoodRayid Ghani, Carnegie Mellon University
Max Richman, DataKind
Sarah Stone, University of Washington
Karthikeyan Umapathy, University of North Florida
Moderator: Pam Davis-Kean, University of Michigan
NOTE:
Societal Scale Impact Day will take place in the Governor’s Boardroom
SUNDAY, October 6th
Conference Welcome [BLUE]
Richard De Veaux, Williams College
Lisa Singh, Georgetown University
George Karypis, University of Minnesota
Jennifer Hill, New York University
MONDAY, October 7th
Registration & Continental Breakfast [BLUE PREFUNCTION]
TUESDAY, October 8th
Registration & Continental Breakfast [BLUE PREFUNCTION]
Technical Sessions
SUNDAY, October 6th 11:00-12:30
Research & Application Session #1: Health
Session Chair: TBD
Learning Personalized Treatment Rules from Electronic Health Records Using Topic Modeling Feature Extraction
Peng Wu, Tianchen Xu and Yuanjia Wang
Chest Tube Management after Lung Resection Surgery using a Classifier
William Klement, Sebastien Gilbert, Donna E Maziak, Andrew J E Seely, Farid M Shamji, Sudhir R Sundaresan, Patrick J Villeneuve and Nathalie Japkowicz
Towards Automated Breast Mass Classification using Deep Learning Framework
Pinaki Sarkar, Priya Prabhakar, Deepak Mishra and Gorthi Subrahmanyam
Research & Application Session #2: Mobility and Smart Cities
Session Chair: Hillol Kargupta
Analyzing Driving Data using the ADAPT Distributed Analytics Platform for Connected Vehicles
Hillol Kargupta and Priyanka Kargupta
Lightweight Traffic Flow Prediction based on Gated Recurrent Unit with small dataset
Hongze Wang and Jing Liu — No SHOW
Higher Order Mining for Monitoring District Heating Substations
Shahrooz Abghari, Veselka Boeva, Jens Brage, Christian Johansson, Håkan Grahn and Niklas Lavesson
Research & Application Session #3: Time 1
Session Chair: TBD
Comparison of Variable Selection Methods for Forecasting from Short Time Series
Monnie McGee and Robert Yaffee
Variable-lag Granger Causality for Time Series Analysis
Chainarong Amornbunchornvej, Elena Zheleva and Tanya Berger-Wolf
Matrix Profile XVI: Efficient and Effective Labeling of Massive Time Series Archives
Frank Madrid, Shailendra Singh, Quentin Chesnais, Kerry Mauck and Eamonn Keogh
Research & Application Session #18: Machine Learning
Session Chair: Akram Farhadi
A Rademacher Complexity Based Method for Controlling Power and Confidence Level in Adaptive Statistical Analysis
Lorenzo De Stefani and Eli Upfal
Constrained Multi-Objective Optimization for Automated Machine Learning
Steven Gardner, Oleg Golovidov, Joshua Griffin, Patrick Koch, Wayne Thompson, Brett Wujek and Yan Xu
Bighead: A Framework-Agnostic, End-to-End Machine Learning Platform
Eli Brumbaugh, Mani Bhushan, Andrew Cheong, Michelle Gu-Qian Du, Jeff Feng, Nick Handel, Andrew Hoh, Jack Hone, Brad Hunter, Atul Kale, Alfredo Luque, Bahador Nooraei, John Park, Krishna Puttaswamy, Kyle Schiller, Evgeny Shapiro, Conglei Shi, Aaron Siegel, Nikhil Simha, Marie Sbrocca, Shi-Jing Yao, Patrick Yoon, Varant Zanoyan, Xiao-Han T. Zeng and Qiang Zhu
SUNDAY, October 6th 02:00-04:00
Research & Application Session #4: Text and Language
Session Chair: Incheon Paik
Detecting Sensitive Content in Spoken Language
Rahul Tripathi, Balaji Dhamodharaswamy, Srinivasan Jagannathan and Abhishek Nandi
Text Summarization by Highlighting Core Content
Zhixin Li, Zhi Peng, Suqin Tang and Canlong Zhang — No SHOW
Cross-Media Image Text Retrieval Combined with Global Similarity and Local Similarity
Zhixin Li, Feng Ling and Canlong Zhang
Paths to empathy: heterogeneous effects of reading personal stories online
Mahnaz Roshanaei, Christopher Tran, Sylvia Morelli, Cornelia Caragea and Elena Zheleva
Research & Application Session #5: Marketing and Demand Forecasting
Session Chair: P. Krishna Reddy
Uplift Modeling for Multiple Treatments with Cost Optimization
Zhenyu Zhao and Totte Harinen
Maximum Relevance and Minimum Redundancy Feature Selection Methods for a Marketing Machine Learning Platform
Zhenyu Zhao, Radhika Anand and Mallory Wang
Topology-based Clusterwise Regression for User Segmentation and Demand Forecasting
Rodrigo Rivera-Castro, Aleksandr Pletnev, Polina Pilyugina, Grecia Diaz, Ivan Nazarov, Wanyi Zhu and Evgeny Burnaev
On Analysing Supply and Demand in Labor Markets: Framework, Model and System
Hendrik Santoso Sugiarto, Ee-Peng Lim and Ngak-Leng Sim
MONDAY, October 7th 09:00-10:30
Research & Application Session #6: Manufacturing
Session Chair: Zhen Gao
Data-Driven Fault Diagnosis in End-of-Line Testing of Complex Products
Vitali Hirsch, Peter Reimann and Bernhard Mitschang
A real-time iterative machine learning approach for temperature profile prediction in additive manufacturing processes
Arindam Paul, Mojtaba Mozaffar, Zijiang Yang, Wei-keng Liao, Alok Choudhary, Jian Cao and Ankit Agrawal
Research & Application Session #7: Graphs and Social
Session Chair: Lorenzo De Stefani
Dynamic graph embedding via LSTM history tracking
Shima Khoshraftar, Sedigheh Mahdavi, Aijun An, Yonggang Hu and Junfeng Liu
SliceNDice: Mining Suspicious Multi-attribute Entity Groups with Multi-view Graphs
Hamed Nilforoshan and Neil Shah
FARE: Schema-Agnostic Anomaly Detection in Social Event Logs
Neil Shah
MONDAY, October 7th 11:00-12:30
Research & Application Session #8: Images
Session Chair: TBD
Deep Crowd Counting In Congested Scenes Through Refine Modules
Tong Li, Chuan Wang and Xiaochun Cao
Lightweight and Scalable Particle Tracking and Motion Clustering of 3D Cell Trajectories
Mojtaba Sedigh Fazli, Rachel V. Stadler, BahaaEddin Alaila, Stephen A. Vella, Silvia N. J. Moreno, Gary E. Ward and Shannon Quinn
Martensite Start Temperature Predictor for Steels Using Ensemble Data Mining
Ankit Agrawal, Abhinav Saboo, Wei Xiong, Greg Olson and Alok Choudhary
Research & Application Session #9: Time 2
Session Chair: Mahnaz Roshanaei
Biased Resampling Strategies for Imbalanced Spatio-Temporal Forecasting
Mariana Oliveira, Nuno Moniz, Luis Torgo and Vítor Santos Costa
Colorwall: An Embedded Temporal Display of Bibliographic Data
Jing Ming and Li Zhang
Hierarchical LSTM Framework for Long-Term Sea Surface Temperature Forecasting
Xi Liu, Tyler Wilson, Pang-Ning Tan and Lifeng Luo
MONDAY, October 7th 02:00-03:30
Research & Application Session #10: Classification
Session Chair: Wayne Thompson
On the classification consistency of high-dimensional sparse neural network
Kaixu Yang and Taps Maiti
Residual Networks Behave Like Boosting Algorithms
Chapman Siu
A Novel Multiple Classifier Generation and Combination Framework Based on Fuzzy Clustering and Individualized Ensemble Construction
Zhen Gao, Maryam Zand and Jianhua Ruan
Research & Application Session #11: Subgroups
Session Chair: Neil Shah
SeqScout: Using a Bandit Model to Discover Interesting Subgroups in Labeled Sequences
Romain Mathonat, Diana Nurbakova, Jean-Francois Boulicaut and Mehdi Kaytoue
FSSD – A Fast and Efficient Algorithm for Subgroup Set Discovery
Adnene Belfodil, Aimene Belfodil, Anes Bendimerad, Philippe Lamarre, Celine Robardet, Mehdi Kaytoue and Marc Plantevit
Research & Application Session #12: Clustering and Low-Rank Models
Session Chair: Julia Warnke-Sommer
A Rapid Prototyping Approach for High Performance Density-Based Clustering
Saiyedul Islam, Sundar Balasubramaniam, Poonam Goyal, Ankit Sultana, Lakshit Bhutani, Saurabh Raje and Navneet Goyal
Short Text Embedding for Clustering based on Word and Topic Semantic Information
Ziheng Chen and Jiangtao Ren
Shape Constrained Tensor Decompositions
Bethany Lusch, Eric C. Chi and J. Nathan Kutz
TUESDAY, October 8th 10:30-12:00
Research & Application Session #13: Regression
Session Chair: Bethany Lusch
Explaining the Performance of Black Box Regression Models
Ines Areosa and Luis Torgo
Customized Interpretable Conformal Regressors
Ulf Johansson, Cecilia Sönströd, Tuwe Löfström and Henrik Boström
A Study on the Impact of Data Characteristics in Imbalanced Regression Tasks
Paula Branco and Luis Torgo
Research & Application Session #14: Recommender Systems
Session Chair: Mohit Sharma
Grade Prediction with Neural Collaborative Filtering
Zhiyun Ren, Xia Ning, Andrew Lan and Huzefa Rangwala
MARS: Memory Attention-Aware Recommender System
Lei Zheng, Chun-Ta Lu, Lifang He, Sihong Xie, Huang He, Chaozhuo Li, Vahid Noroozi, Bowen Dong and Philip Yu
VizCertify: A framework for secure visual data exploration
Lorenzo De Stefani, Leonhard F. Spiegelberg, Eli Upfal, and Tim Kraska
TUESDAY, October 8th 1:30-03:00
Research & Application Session #15: Relation Extraction and Record Linkage
Session Chair: Edward Porter
Attention-Based Network with Prior Knowledge for Domain-Specific Relation Extraction
Liqi Kang, Chong Feng, Ge Shi, Jun Ma, Yang Wang and Heyan Huang — No SHOW
A Novel Record Linkage Interface That Incorporates Group Structure to Rapidly Collect Richer Labels
Kayla Frisoli, Benjamin LeRoy and Rebecca Nugent
Machine Learning for Efficient Integration of Record Systems for Missing US Service Members
Julia Warnke-Sommer and Franklin Damann
TUESDAY, October 8th 3:30-04:30
Research & Application Session #16: Kernel Methods
Session Chair: Efstathia Bura
Semantically-aware Statistical Metrics via Weighting Kernels
Stefano Cresci, Roberto Di Pietro and Maurizio Tesconi
Joint Selection of Central and Extremal Prototypes Based on Kernel Minimum Enclosing Balls
Christian Bauckhage and Rafet Sifa
Research & Application Session #17: Streams
Session Chair: TBD
Unsupervised Drift Detector Ensembles for Data Stream Mining
Lukasz Korycki and Bartosz Krawczyk
An Incremental Technique for Mining Coverage Patterns in Large Databases
Akhil Ralla, P. Krishna Reddy and Anirban Mondal
Special Sessions
MONDAY, October 7th, 9:00 – 12:30
Keynote: “Towards High-Quality Big Data for Responsible Data Science”
Divesh Srivastava
Range Analysis and Applications to Root Causing
Zurab Khasidashvili and Adam J Norman
Sensor-Based Human Activity Mining Using Dirichlet Process Mixtures of Directional Statistics Models
Lei Fang, Juan Ye and Simon Dobson
Truth Discovery from Multi-Sourced Text Data Based on Ant Colony Optimization
Chen Chang, Jianjun Cao, Guojun Lv and Nianfeng Weng
TUESDAY, October 8th, 10:30 – 04:30
Breast Cancer Classification using Deep Transfer Learning on Structured Healthcare Data
Akram Farhadi, David Chen, Rozalina McCoy, Christopher Scott, John Miller, Celine Vachon and Che Ngufor
Feature Enhanced Fully Convolutional Networks for Monocular Depth Estimation
Chunxiu Shi, Jie Chen, Juan Chen and Zheng Zhang
Least Squares and Maximum Likelihood Estimation of Sufficient Reductions in Regressions with Matrix Valued Predictors
Ruth Pfeiffer, Wei Wang and Efstathia Bura
Keynote: “Neurotechnologies and machine learning: from neurosurgery and neurology to neuroeducation and cybersport”
Evgeny Burnaev, Alexander Bernstein, Maxim Sharaev
TBOVW : A Temporal Bag of Visual Word Model based Molecular Subtypes Recognition of Breast Cancer in DCE-MRI with Heterogeneity
Wei Li, Chaolu Feng, Dazhe Zhao, Yiqing Wang, Dongjie Wang and Xin Min — No SHOW
Sign Language Recognition Analysis using Multimodal Data
Al Amin Hosain, Panneer Selvam Santhalingam, Parth Pathak, Jana Kosecka and Huzefa Rangwala