Results 1 to 2 of 2

Thread: Parsing PDF files with vim?

  1. #1

    Parsing PDF files with vim?

    Alright here's the deal. A friend of mine has a job with a microchip corporation. They use MS Word to write their manuals and have hidden text contained within each one. She's supposed to make a PDF out of these MS Word files, and then make absolutely certain that none of the hidden text appears in the PDF (this includes editing the actual PDF with an editor, running a perl script on it, etc...). What she's done is created a dummy PDF file with a hidden sentence in the original Word document so she can ensure that the hidden sentence didn't show up. She's done all she can to make sure the hidden sentence didn't appear, but she'd like to make certain so she's asked me and a few other friends to see if we can find the hidden sentence within the PDF.
    My question: Does anyone know of a tool for linux that can do just that or be used to do that (ie. parse a PDF file to find any traces of the hidden sentence)? I believe vim might be useful for this but I'm not sure. Alternatively, would perl be capable of this? Could somebody maybe point me in the right direction as to writing this perl script if it would? Thanks for any help you can offer.

  2. #2

    Re: Parsing PDF files with vim?

    How about something like:

    strings myfile.pdf |grep "This is a hidden sentence"

    strings extracts ascii text strings, grep parses them for a particular pattern. Is that where you're coming from?

Similar Threads

  1. DVD Files
    By sabkuchindia in forum Linux - Software, Applications & Programming
    Replies: 1
    Last Post: 04-20-2009, 05:38 PM
  2. Configuring Parsing of the AUTOEXEC.BAT File
    By regix in forum Windows - General Topics
    Replies: 0
    Last Post: 01-04-2005, 08:56 AM
  3. End of The X-Files
    By cga in forum General Chat
    Replies: 6
    Last Post: 05-23-2002, 02:11 PM
  4. files
    By linuxrookie in forum Linux - General Topics
    Replies: 3
    Last Post: 05-12-2002, 04:28 AM
  5. parsing html as php.
    By popcorn in forum Linux - Hardware, Networking & Security
    Replies: 7
    Last Post: 12-21-2001, 06:56 PM


Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts