Data wrangling, sometimes referred to as data munging, is the process of transforming and
mapping data from a raw form into another format with the intent of making it useful
for a variety of purposes such as analytics.
Raw forms that need to be wranged include text, html, xml,
mainframe data files, PDF, Word documents, Excel xls, Fixed length or delimited files (CSV, Tab, Pipe, etc),
log files, and structured and unstructured reports of any size or dimension.
TextPipe will help you wrangle:
HTML, XML and other structured documents from the WWW
Fixed length or delimited files (CSV, Tab, Pipe, etc)
Inside Zip files, and Microsoft Office 2007/2010 Word (DOCX),
Excel (XLSX), PowerPoint (PPTX), OpenOffice Writer (ODT), Calc (ODS)
and Impress (ODP) formats
Simple EBCDIC files with no structure. These can be converted using the Mainframe \ EBCDIC to ASCII filter.
Fixed length mainframe files. Use the Mainframe \ EBCDIC to ASCII filter followed by the Convert \ End of Line Characters filter.
For 132 column mainframe reports, set the fixed length to 133
Mainframe CMS Format (variable line length files). Use the Convert \ End of Line Characters filter.
Fixed record size with a copybook (e.g. with packed and zoned decimals). Use the Mainframe Copybook filter and simply paste in your copybook.
Multi-record formats
Structured and unstructured reports of any size or dimension
Wrangle data today
Automation
Once tested, the TextPipe filter can be run automatically or on a schedule from the command line. It can also be controlled by other software using COM.