I am

Yizhou Yan

A full-time Ph.D Student @ Worcester Polytechnic Institute
Involving with big data analytics and mining.

About Me

  • Name: Yizhou Yan
  • Date of birth: 2 Mar 1991
  • Address: FL 319, 100 Institute Road,
    Worcester, MA
  • Nationality: Chinese
  • Email: yyan2@wpi.edu

Short Bio

I am working under the supervision of Professor Elke A. Rundensteiner. I'm interested in big data analytics and mining, especially in areas of outlier detection. Recently, I mainly focus on outlier detection in IoT data.
I am looking for summer internship of 2018.

Education

  • 2015-Now

    Ph.D Candidate

    Computer Science, Worcester Polytechnic Insitute,MA, USA

    Focus on outlier detection.

  • 2013-2015

    Master of Engineering (M.E.)

    Software Engineering, Dalian University of Technology, China

    Focused on data mining in scholar data and bioinformatics.

  • 2009-2013

    Bachelor of Engineering (B.E.)

    Software Engineering, Dalian University of Technology, China

    Bachelor of Arts (B.A.)

    Japanese, Dalian University of Technology, China

Research

  • 2017.02-Now

    TOP

    Typical and Outlier Pattern Mining in IoT Sequence Data (Collaborating with MIT CSAIL and Philips Research)
    In submission

    In this paper, we present a new system called TOP to make sense of sequences by finding Typical and Outlier Patterns in IoT sequence data. Specifically, TOP offers a new frequent pattern semantics called Contextual Bi-frequent patterns (CBF patterns). Moreover, we develop customized algorithms for mining CBF patterns and outliers that violate CBF patterns.

  • 2016.06-2017.02

    TOLF

    Scalable Top-n Local Outlier Detection
    Accepted by KDD 2017 VIDEO

    In this work, we present the first scalable Top-N local outlier detection approach called TOLF. The key innovation of TOLF is a multi-granularity pruning strategy that quickly prunes most points from the set of potential outlier candidates. Our customized density-aware indexing structure not only effectively supports the pruning strategy, but also accelerates the kNN search.

  • 2015.11-2016.06

    DLOF

    Distributed Local Outlier Detection in Big Data
    Accepted by KDD 2017 VIDEO

    In this work, we present the first distributed solution for the Local Outlier Factor (LOF) method, namely DLOF, which is scalable to terabyte level data.

  • 2015.09-2015.11

    DOD

    Multi-Tactic Distance-Based Outlier Detection
    Accepted by ICDE 2017

    In this work, we present the first distributed distance-based outlier detection approach using the MapReduce-based infrastructure, called DOD. Our experimental study confirms the efficiency of DOD and its scalability to terabytes of data, beating the baseline solutions by a factor of 20x.

Publication

  1. Yizhou Yan, Lei Cao, Elke Rundensteiner.
    Scalable Top-n Local Outlier Detection. KDD 2017.

    VIDEO
  2. Yizhou Yan, Lei Cao, Caitlin Kuhlman, Elke Rundensteiner.
    Distributed Local Outlier Detection in Big Data. KDD 2017.

    VIDEO
  3. Lei Cao, Yizhou Yan, Caitlin Kuhlman, Qingyang Wang, Elke A. Rundensteiner, Mohamed Eltabakh.
    Multi-Tactic Distance-Based Outlier Detection. 2017 IEEE 33rd International Conference on Data Engineering (ICDE). IEEE, 2017.
  4. Caitlin Kuhlman, Yizhou Yan, Lei Cao, Elke Rundensteiner.
    Pivot-based Distributed K-Nearest Neighbor Mining. ECML-PKDD 2017.
  5. Yu Liu, Zhen Huang, Yizhou Yan, Yufeng Chen.
    Science Navigation Map: an Interactive Data Mining Tool for Literature Analysis. WWW’ 15 Companion, Florence, Italy May 18-22,2015.
  6. Yu Liu, Zhen Huang, Jing Fang, Yizhou Yan.
    An Article Level Metric in the Context of Research Community. WWW’ 14 Companion, Seoul, Korea, April 7-11, 2014.
  7. Zhewen Shi, Yu Liu, Yizhou Yan, Xiaowei Zhao.
    A Hierarchical Community Detection Method in Complex Networks. Journal of Computational Information Systems, vol.9, no.24, pp. 9715-9724, 2013.

Skills

Java

90%

C/C++

60%

Python

65%

Matlab

65%

Data Mining

75%

Database

75%

Distribute Sys

65%

English

80%

Japanese

80%

Chinese

100%

More skills

leadership
Creativity
Management
Branding
Marketing
Motivation

Get in touch

Address/Street 100 Institute Road,
Worcester, USA
Phone Number 508-414-8049