Josslyn Zhang Aspiring Data Engineer

A Few Words About Myself

I'm Josslyn Zhang, a recent graduate in Statistics, Computer Science and Mathematics. I'm currently based in Denver, Colorado and in the future I hope to work as a Data Engineer. My curiosity for search and recommendation engines drove me to work with data in academia. I've recently been experimenting with mathematical modeling and machine learning in my spare time with Python, leveraging tools such as TensorFlow. I also enjoy volunteer work, travel, street food and in the future I hope to learn surfing and diving.

Design

Statistics modeling tools that I enjoy: Apache spark and Hadoop MapReduce. RDBMS design / MySQL Workbench.

Code

Coding language most familiar to least: Python, JAVA, C. Mathematically programming most familiar to least: R, SAS, MATLAB.

Tools

Familiar with jetbrains tools, especially IntelliJ Idea, PyCharm and CLion. Atom is also being used a lot for code editing.

Featured Projects

proj1

Tongue Stimulation for Neural Communication of Audio Information

  • Programming: SAS, R
  • Statistical Testing: Likelihood Tests, F-tests
  • Mathematical Skills: Linear Mixed Modeling, Data distribution
Description
proj2

Estimating PageRank Values of Wikipedia Articles with Apache Spark

  • Algorithms: Idealized and taxation-based Page Rank algorithms
  • Analytics Tools: Hadoop MapReduce, Apache Spark and HDFS
Description
mountains

Document Summarization using TF/IDF Scores

  • Built an authorship identification system based on similarity analysing.
  • Calculated TF/IDF scores and documented summarization using Hadoop MapReduce
Description
proj4

Design and implement a new Application using Raspberry Pi

  • Created a socket program using Python
  • Add-on sensor board: Sensor Hat
Description