Look inside the new edition

The first edition (2001) was the go-to handbook for Perl developers who needed to coerce scruffy, inconsistent data into tidy, predictable formats. Twenty-four years on, data is bigger, messier and more multilingual – and Perl has evolved to match.

This edition adds:

  • 12 fully updated chapters
  • Perl 5.40 syntax throughout
  • Modern CPAN modules
  • New data formats
  • Best practices
Purchase on LeanPub

Data Parsing

Understand how to effectively parse complex data structures for easier manipulation and analysis.

Data Cleaning

Learn techniques to efficiently clean and preprocess messy data for accurate analysis and reporting.

Data Transformation

Master the art of transforming data formats to suit various applications and integration needs.

"A non-academic, on-the-money, practical book on converting, filtering and parsing data"

Full Chapter Overview

The second edition of Data Munging with Perl is structured around a practical, format-driven approach to working with messy data. Each chapter focuses on a different kind of structure or technique — from simple lines of text to nested formats and binary records — with real-world examples throughout.

Chapter 1: Data, data munging and Perl

iWhy Perl is still an excellent tool for data wrangling.

Chapter 3: Useful Perl idioms

Modern Perl syntax and idioms every munging script needs.

Chapter 5: Unstructured data

Techniques for parsing free-form or chaotic input.

Chapter 7: Fixed-width and binary data

Reading and unpacking data where position matters.

Chapter 9: HTML

Extracting meaningful data from markup with modern tools.

Chapter 11: Building your own parsers

Regex, grammars and Regexp::Grammars.

Chapter 2: General munging practices

Core principles for approaching any munging task.

Chapter 4: Pattern matching

Using regex and Perl’s pattern tools to extract structure.

Chapter 6: Record-oriented data

Working with delimited, structured text formats like CSV.

Chapter 8: Complex data formats

Introducing parsers for complex data.

Chapter 10: Hierarchical data

Parsing and generating XML, YAML and JSON effectively.

Chapter 12: Looking back—and ahead

What you've learned and where to learn more.

What people are saying

Reviews of the first edition

Testimonial author

John M. Ford

Amazon

Cross has written a well-designed book with helpful examples and insights. The accompanying book web site and author web site provide downloadable code and other resources. This book is of course most useful to those working in Perl. But many general concepts and strategies have transferred well to data munging tasks I have done in TextPipe.

Testimonial author

Thing with a hook

Amazon

You're shown all the handy tricks you need such as piping, taking input from standard in as well as files, slurping paragraphs etc. My 4-star rating applies if this sounds like what you need: it's a clear, short and to-the-point book, which is definitely taking with you on your first journey into data munging.

Testimonial author

Marc Perry

Goodreads

Day-to-day, in the trenches, this is the book that has influenced my own coding more than any other. Lucid and practical; every page brims with proven advice..

Testimonial author

Plato

PerlMonks

After a bit of research I bought this book last week, I'm just sorry that I didn't buy it months ago. I like the style that it's written in, and it's chock-a-block full of useful information. I've been writing Perl (hacking at it really) for about six months now, and this is just the sort of stuff that I need to improve my skills.

Read it on your favorite device!

Lorem ipsum dolor sit amet, consectetur adipisicing elit. Molestiae quod, nobis repudiandae! In commodi accusamus molestiae, sunt, tempora fugiat doloribus sint quisquam illo numquam corporis tenetur necessitatibus expedita eius, sed.

Take a sneak peak insight

Sed laborum omnis earum facere culpa optio natus quaerat minus.

Reporting & Analysis

Sed laborum omnis earum facere culpa optio natus quaerat minus.

Purchase on LeanPub

Yasin Arafat

- A technical Writer

Do you want to be even more successful? Learn to love learn and growth. The more efort you put into improving your skills, the bigger the payoff you will get.

It is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout.

We are award winner best selling book among the top rated 10 books!

International award
Meritorious award
Best book award
Excellence award

Others books by Dave Cross

Laboriosam eos aperiam excepturi accusamus velit repudiandae sit rem incidunt, similique facere quas blanditiis.

Our insights and articles

Laboriosam eos aperiam excepturi accusamus velit repudiandae sit rem incidunt, similique facere quas blanditiis.

Contact Us

Whether you have questions or you would just like to say hello, contact us.Lorem ipsum dolor sit amet, consectetur adipisicing elit. Provident, atque!

Contact Quickly

+23-68017684

Email

startor@support.com

Location

397 Lake forest drive street USA