This webinar will discuss a CSIAC-developed prototype for detecting and extracting PII from over a thousand binary file formats by leveraging the widely used open source Apache Tika toolkit. The prototype, called “BFAS – Binary File Application Scanner”, integrates Tika through the implementation of a custom Powershell cmdlet which seamlessly injects a text extraction facility into the standard (existing) Powershell pipeline. A graphical user interface (GUI) was developed to facilitate multiprocessing and XML-based reporting and visualization. Ideas for extending the BFAS architecture to leverage machine learning (ML) methods will be discussed.
/ / Register for CSIAC Webinar Thursday, May 30 @ 12:00 pm EDT: BFAS – Binary File Application Scanner: A Prototype for Scanning, Detecting and Reporting PII in Disparate Binary Formats