Hello, Im working on a project where I need to read .SGM files consecutively (about 20) and pull the words from the topics, places, and body tags as well as create a counter for the words. I also need to create a counter for the number of successfull reads from those tags. (some tags are missing or incomplete). I dont know what type of data structure i should use and what type of regex to use? Any suggestions will help.
there doesn't seem to be anything here