Skip to Main Content



Michelle's Homepage

Basic information about me

This page lists my workshops with links to the newest version of materials. Upcoming fall/spring workshops that I teach can be found on this event page.

Unix Command Line Fundamentals

Why Unix Command Line?

The Unix command line is a powerful tool that allows you to control operation systems (OSs) directly with text commands, which allows more flexibility and efficiency beyond what the Graphic User Interfaces (GUIs) offer. In addition, complex calculations and data analyses are often run on High-performance computing (HPC) clusters; and HPC clusters are often operated in Unix systems. 

What's in this workshop?

This workshop explains the most basics of Unix commands and you will learn to navigate through folders and view files. No prior knowledge is required.

Git Basics for Absolute Beginners

Why Git?

Git is a version control system: it tracks and manages changes to your files over time and allows you to include explanations for each change. This enables you to investigate changes and correct problems without losing code or data. Git works best with plain text data (such as txt, csv, tsv files) and most code files.

What's in this workshop?

This workshop introduces you to Git–how it works, its benefits, and how to get started—by learning and practicing the basics. This workshop is for anyone that works with plain text data and code but requires no prior knowledge of Git.

Basic Data Cleaning with OpenRefine

Why OpenRefine?

In preparation for data analysis or training a machine learning model, do you ever find yourself spending endless hours fixing typos and inconsistent formats in your dataset? Are you frustrated with repeating the same cleaning steps on multiple similar datasets? Faster than Excel, easier than Python, OpenRefine is a free and powerful software to explore, transform, and clean your data and document your cleaning process so you can apply the same steps to similar data.

What's in this workshop?

This workshop will showcase the basics of OpenRefine through through explanation and hands-on exercise. Bring your own messy data to tailor to your needs after a guided introduction. This is a beginner level workshop, and no prior experience is needed.

Bullet Journal as Research Planner

Why bullet journal?

Do you feel overwhelmed by the tasks and deadlines in your research project? Do you wish for a more flexible research planner? If so, bullet journaling could be the answer for you. Bullet journaling is a popular organization and productivity system that can be customized to fit your specific needs.

What's in this workshop?

This workshop will teach you the basics of setting up your own research planner. You will learn and practice rapid log notation, which allows you to organize information and tasks in a concise and structured manner. No prior experience or fancy supplies required - all you need is a pen.

One-hour Data Management

Why data management?

An ounce of prevention is worth a pound of cure, especially when it comes to your valuable files. We've all heard of the horror stories of someone losing their important files or the panic of data leaks. There are 5 key practices that can mitigate these risks.

What's in this workshop?

In the first 30 minutes of this workshop, we'll review 5 data management practices: backups, naming & organization, documentation, security, and future proofing. The second half is optional time for you to stay around for discussion and questions. This workshop is for anyone who wants to improve their data management. Feel free to bring your own files to get help on.

This brief workshop is based on the more extensive "Quick & Dirty Data Management" workshop materials provided by MIT. 

Other Research Data Workshops

Besides mine, the University Library also provides other research data themed workshops, for example:

  • Introduction to DataShare and the ISU Digital Repository
    • This workshop covers how ISU's open-access repositories--DataShare and the Digital Repository--can help ISU faculty, students, and staff share and preserve your paper and data publications. You'll learn how to add publications to these systems and about benefits like tracking statistics, stable links, and author profiles.
  • Unpacking (Cy)Box for Better File Management
    • This workshop teaches you to organize files for easy retrieval and sharing. Discover how to manage files effectively, access them from anywhere, and collaborate with others. Leave the workshop armed with file management strategies, including naming conventions, folder structures, and documentation techniques.
  • Escape from Spreadsheet Hell
    • This workshop introduces ways to structure your research data so it's both computer and human friendly. You can’t escape spreadsheets, but you can escape spreadsheet hell! This beginner-level Zoom workshop covers how to organize and document data in a spreadsheet program such as Excel. Includes some hands-on practice.

Check out library event page for upcoming workshops.

 

Profile Photo
Michelle Zhai
Contact:
The Catalyst / 204 Parks Library
(515) 294-3669