This python script parses an input text file (utf8 or default codec) and searches for strings between "" or '' starting with a given prefix. For example, it does allow you to get a list of all attrbiutes with "attr_" in your html.html or all "roll_" names. The output is written to a file output.txt, will overwrite any given output.txt in the same folder and sort the keys alphabetically. No idea if someone else has a use for this, but I required it and as I had written it anyway I thought I can share it. The code is given AS IS without any warranty and is released under Public Domain. It might break under linux or with a different set default codec. <a href="https://gist.github.com/NoxMortem/8525e8413c70d085" rel="nofollow">https://gist.github.com/NoxMortem/8525e8413c70d085</a>... Example Usage with html.html copied to the same folder as parse.py: python parse.py "html.html" "attr_" Python 3 is available under&nbsp; <a href="https://www.python.org/" rel="nofollow">https://www.python.org/</a> and if it does not recognize the python command replace "python" with the whole path like "C:\python\python.exe"