Pattern Discovery of Sequential Symbolic Data using Automata with an application to Author Identification