Text Data Processing

Now it is an age of big data. The fundamental of big data is how to process data efficiently. This section is about text data processing, including data organization, regular expression, and web scraping.

Examples can be found in the timing folder.

Data Format

There are two basic packages for reading/writing ascii data:

The first one is a built-in package for handling relatively simple data, while the second one is for handling complicated data with seamless support for DataFrame.

Regular Expression

Julia has built-in support for regular expressions. The best reference and tool I have ever used for regular expression in general is RegExr.

Web Scraping

Packages worth mentioning: