Eva Nguyen
  • About Me
Navbar avatar

Eva Nguyen


This is where I showcase my data science and analytics projects

Comparing Machine Learning Models

Posted on January 19, 2020

Which of the 3 models is the best for this data? Based on the evaluation metrics below, I would choose the k-nearest neighbors (K=3) method as the ‘best’ for this data. The reason is KNN outperforms for all metrics when comparing to LDA and QDA. The KNN has the highest... [Read More]
Tags: R programming RStudio KNN LDA QDA statistical modeling machine learning supervised learning confusion matrix F1 score logloss

Data Manipulation with Natural Language Processing (NLP)

Posted on December 12, 2019

The project’s purpose is to clean and manipulate text data prior to applying machine learning techniques. The dataset contains errors to simulate data collection in the real world. [Read More]
Tags: data manipulation python NLTK NLP pandas numpy matplotlib

R Shiny App

Posted on December 10, 2019

The project’s purpose is to develop an R Shiny app for stakeholders, such as realtors, to use to make effective business decisions. [Read More]
Tags: R programming RStudio R Shiny data visualization dashboard
  • ← Newer Posts
  • Email me
  • GitHub
  • LinkedIn

Eva Nguyen  •  2020  •  nguyeneva.github.io

Theme by beautiful-jekyll