Hello! I run a law website and would like to convert them to a CMS. Of all the ones I've tinkered with, Joomla comes on top. However, I've come upon a really big snag and thats importing my old HTML pages to Joomla's content pages.
Of course, there's always cutting and pasting these pages using the "new content" function but believe me, this will take ages and ages to accomplish. If you're familiar with laws, supreme court cases and other legal documents you'll know that they're both verbose and numerous. To give you an idea, I'm working with over 50,000 individual HTML pages that have been encoded over the past four years. They also have numerous H1 to H5 tags that are used to number and classify the different sections of laws and cases. This covers laws and cases promulgated since 1901 to 2004. If you want to see the site its at
www.lawphil.net.
It seems that I might be stuck doing the whole copy-paste thing but 50K pages is, without a doubt, quite a herculean task. I've tried this component, site_import which doesn't really work for me. I can't figure out how it works and based on the description, it would probably rip my pages apart as it uses the H1 and H2 tags for sections and categories, respectively.
I've seen the posts on something similar to what I'm doing but so far I haven't seen anyone close to having the volume of pages I have. I was wondering (hoping actually) that someone might be able to help me. If it means anything, I've got consistent formatting among all the pages and mostly use just the H1 to H5, P, and BR tags and leave all the text formatting to a single CSS file. Please, please help me.
Thanks in advanced! I hope you can help me
