The problem description says we got to read the files instead of standard input starting with index.htm.
Contrary, Sample input says files are given as standard input each separated by line -The only HTML command you need to worry about is the HREF command, and you can assume that it will always be in the form <A HREF="filename">, with no additional spaces or other characters; that the name of the file is legal and in the same directory as the file you are already reading; and that the name of the file will not exceed twelve characters in length. Filenames will always end with ``.htm".
(understanding above is NP-Hard):)The initial HTML file you should start indexing will be named index.htm. Next the other files, including webpage.in, with a single blank line separating each listing. The words in webpage.in will be placed one word per line, with no additional spaces.
Now if I start with case-2, I am confused with sample input because there in no way to figure out "file name" ? see file -2 listing which some how to be layout.htm
What is correct input , output format?
Please clarify
-A