2015 AIChE Spring Meeting and 11th Global Congress on Process Safety

(204a) Process Learning Data Pipeline for Dealing with the Big Data Challenges

To uncover the valuable information from a large amount of process data stored in databases, a process learning data pipeline consisting of transformation, cleaning, re-sampling and exploratory data analysis (EDA) units is built using Python with a graphical user interface (GUI). The pipeline incorporates techniques and knowledge from various fields, such as statistics, computer science, and signal processing. A test case is provided based on industrial data set, and by analyzing the results coming from the pipeline, a better understanding of the process is obtained.