Java/ python developers

hi all i am looking for a auto proj for my office work where in i want to read my word(doc/docx) files/pdf files and extract data out of them to be stored in an excel sheet (for eg).
if anyone can help with elk for same problem …it will be wonderful