Ran Liu

PhD of Computer Science

University of Maryland, Baltimore County

Biography

My strongest research area is in machine learning, with a specific focus on classification and detection problems. This includes working on attack detection and the development of advanced techniques like generative adversarial networks (GANs) to efficiently generate adversarial examples for the adversarial training. My work primarily uses feature engineering and data quality improvement techniques.

Contact me at: rliu2{at}umbc{dot}com.

Download my resumé .

Interests

Machine Learning
Cybersecurity
Time Series Model
Back-End Development

Education

PhD in Computer Science, 2024
University of Maryland, Baltimore County
MS in Computer Science
University of California, Santa Barbara
MS in Information Security
Johns Hopkins University

Recent News

[Nov 11] Our paper " Evaluating Representativeness in PDF Malware Datasets: A Comparative Study and a New Dataset" has been accepted for publication at CyberHunt 2023 (IEEE bigdata 2023)

[Jul 12] Our paper “A PDF Malware Detection Method Using Extremely Small Training Sample Size” has been accepted for publication in DocEng ‘23 conference.

[June 28] Our paper “A Feature Set of Small Size for the PDF Malware Detection” has been accepted by KDD'23 Workshop on Knowledge-infused Learning.

[May 28] Our poster “A Distance-Based Method for Detecting PDF Malware” has been accepted by METM 2023

Last updated on Nov 12, 2023 1 min read

Experience

Machine Learning Research Intern

SAS Institute Inc. IDeaS Division

Jan 2023 – Aug 2023 Minneapolis, MN

Responsibilities include:

Designed machine learning predictive model using PyTorch for competitors identification through Hidden Markov Model, Convolutional Neural Network and Dynamic Time Warping, improving detection accuracy from 46% to 67%.
Developed RESTful APIs for the production system using Java Spring Framework, and implemented a lightweight version algorithms using MySQL, benefiting over 30,000 clients.
Designed statistical model to predict errors in forecasting system through Random Forest and ARIMA, decreasing the mean absolute percentage error (MAPE) from 0.17 to 0.04, contributing to over $12 million revenue.
Designed clustering algorithms utilizing GNN and GAN with TensorFlow. Refactored the base code using Java functional programming, reducing running time from approximately 8 days to around 3 hours.

Software Engineer Intern

Amazon

May 2022 – Aug 2022 Seattle, WA

Responsibilities include:

Designed and implemented new I/O APIs optimizations using C++ that were used by all teams in the AWS Aurora division, reducing the IOPS (Input/Output Operations Per Second) of the AWS Aurora over 80%.
Designed a new batch data structure for I/O submission, reducing system calls by a factor of 4,096.
Refactored code to eliminate lock dependencies using multithreading techniques, reducing I/O latency by 77%.

Application Security Analyst

Shift4 Payment

Feb 2018 – Aug 2020 Las Vegas, NV

Responsibilities include:

Led 12 penetration tests on web and mobile applications using Burp Suite, NMAP, Windows shell, and Python. Delivered comprehensive risk assessments and recommended solutions to managers and stakeholders.
Led code review processes before application releases, offering in-depth risk assessments and proposing technical solutions to enhance security and programming standards.
Developed statistical models using Hidden Markov Model and Exponential Smoothing to detect transaction fraud.

Projects

Innovative System for Identifying Competitors

Designed machine learning predictive model using PyTorch for competitors identification through Hidden Markov Model, Convolutional Neural Network and Dynamic Time Warping, improving detection accuracy from 42% to 67%.
Developed RESTful APIs for the production system using Java Spring Framework, and implemented a lightweight version algorithms using MySQL, benefiting over 30,000 clients.

Forecasting Errors in Time Series Model

Designed statistical model to predict errors in forecasting system through Random Forest and ARIMA, decreasing the mean absolute percentage error (MAPE) from 0.17 to 0.04, contributing to over $12 million revenue.

The Design of high-performance I/O kernel interface

Developed C++ I/O APIs for AWS Aurora, achieving an 80% reduction in IOPS across all teams.
Created a batch data structure for I/O, cutting system calls by 4,096 times.
Enhanced code using multithreading to remove lock dependencies, lowering I/O latency by 77%.

[Read More]

A Distributed File System with High Scalability and Fault Tolerance

A fault-tolerant distributed file system that supports concurrent read/write operations using AWS, Java RMI, and the Primary protocol. The system has good scalability which can handle 10,000 requests with a maximum of 10,000 requests running concurrently.

[Read More]

A Distributed File System with High Scalability and Fault Tolerance

Recent Publications

Quickly discover relevant content by filtering publications.

Ran Liu, Robert Joyce, Cynthia Matuszek, Charles Nicholas (2023). Evaluating Representativeness in PDF Malware Datasets: A Comparative Study and a New Dataset. IEEE BigData ‘23: Proceedings of CyberHunt workshop of the IEEE BigData Conference.

Ran Liu, Charles Nicholas (2023). A Feature Set of Small Size for the PDF Malware Detection. KDD ‘23: Proceedings of the KDD Workshop on Knowledge-infused Learning.

DOI

Ran Liu, Matuszek Cynthia, Charles Nicholas (2023). A PDF Malware Detection Method Using Extremely Small Training Sample Size. DocEng ‘23: Proceedings of the ACM Symposium on Document Engineering 2023.

DOI

Ran Liu, Maksim Eren, Charles Nicholas (2022). Can Feature Engineering Help Quantum Machine Learning for Malware Detection. MTEM 2022.

Ran Liu, Charles Nicholas (2021). Incremental Malware Detection and Classification Using Hidden Markov Models. ICCWS.

Ran Liu, Charles Nicholas (2021). Incremental Malware Classification Using Hidden Markov Models. MTEM 2021.

Shin, Chanyang, Prerit Chandok, Ran Liu, Seth James Nielson and Timothy R. Leschke (2017). Potential Forensic Analysis of IoT Data: An Overview of the State-of-the-Art and Future Possibilities. 2017 IEEE International Conference on Internet of Things (iThings) (2017): 705-710..

Contact

rliu2{at}umbc{dot}com