Page 1 of 1

going from .doc to html

Posted: Fri Feb 01, 2008 10:51 am
by corwin
I am not computer savvy. Is there some way to transfer stories posted by .doc to html format? My computer won't allow me to read the stories posted in the general discussion that are in the ,doc format.

Re: going from .doc to html

Posted: Fri Feb 01, 2008 11:22 am
by gesit
you'll have to have microsoft word installed in your pc
and what do you mean by won't let you read the .doc's??
good luck

ham

Re: going from .doc to html

Posted: Fri Feb 01, 2008 12:14 pm
by Metatrone
WordPad could also do. But you don't have any king of Office software on your computer? :shock:

Re: going from .doc to html

Posted: Fri Feb 01, 2008 2:43 pm
by Shadowhawk
There are many tools to either view MS Word .doc files (MS WordView is one of them), or translate .doc to .html. Most of the office suites and wordprocessors can read .doc files, for example OpenOffice.org, AbiWord (from GNOME Office), KWord (from KDE's KOffice), and I think all of them have "Save as HTML" option, or "HTML" as one of options to "Save As". There are tools to either display .doc files or translate them to plain text or HTML, like catdoc or word2x (all of them CLI Linux tools).

I usually use KOConverter ('koconverter file.doc file.html') from KOffice to convert from .doc to .html (I use "light" version, which can lose some not very important formatting); for Sennadar stories I use also scripts mentioned on my Sennadar Wiki page: User:Shadowhawk/Scripts for a final polish. Note that CSS style files mentioned in scripts are currently not available, sorry.

Another solution is to try to find on of the free online services to convert from .doc to .html; one of solutions, if you have GMail accout (I guess that Google Docs would also suffice) is to send to yourself DOC or RTF file as an attachement, then choose "View as HTML" and save the result. I have used it with success on one of the RTF files which other tools I have installed (KOffice, AbiWord) couldn't parse.

Re: going from .doc to html

Posted: Fri Feb 01, 2008 9:57 pm
by Fiferguy
I've had a lot of success of using Word to "Save as a Web Page," then opening the file in Dreamweaver and "Cleaning Up Word HTML." Takes a lot of the non-W3C compliant stuff out of the word formatting. :wink:

Re: going from .doc to html

Posted: Fri Feb 01, 2008 9:58 pm
by Fiferguy
I suddenly noticed my new icon... :twisted:

Re: going from .doc to html

Posted: Sat Feb 02, 2008 12:24 am
by Spec8472
Fiferguy wrote:I suddenly noticed my new icon... :twisted:
You mean you didn't put it there? I didn't (no, really).

Re: going from .doc to html

Posted: Sat Feb 02, 2008 1:37 am
by Fiferguy
Spec8472 wrote:
Fiferguy wrote:I suddenly noticed my new icon... :twisted:
You mean you didn't put it there? I didn't (no, really).
If I'd put it there, it would say "I WILL insult Spec8472." :twisted: :mrgreen: 8) :mrgreen: :twisted:

Re: going from .doc to html

Posted: Sat Feb 02, 2008 4:46 am
by Deeb
Image


*whistles innocently* :shock:

Re: going from .doc to html

Posted: Sat Feb 02, 2008 4:58 am
by Fiferguy
MUCH better... thanks Deeb... :twisted: :twisted: :twisted: :twisted: :twisted:

Re: going from .doc to html

Posted: Thu Feb 07, 2008 6:10 am
by Greymist
It may or may not have been me who did it, I was amused to see the replaced version with the strike through on the not though.

Re: going from .doc to html

Posted: Wed Dec 17, 2008 9:12 am
by Bartokian
This might sound stupid - but I am having a similar problem over here...

While I can perfectly read the chapters coming out of the announcement forum (so .doc files), I cannot read the .doc files as downloaded from the wp-blog (Books section). First, he comes and asks for the macro's, but no matter what I answer to that one, my computer refuses to open them up. It claims that either the filename is bad (no use in renaming it), or that it cannot find the file (hey - I did open it, right?). How are these files made?? Maybe this is because I am working over here on a Mac with office 2008. It could very well be that the macro's (Visual Basic) are just not supported on my platform. Is it feasible to also post a central location with all the individual chapters as can be found on the forum in the forum? Just to keep the compatibility with everyone out there...

In the mean time, I will just have to be happy with the forum :D

Re: going from .doc to html

Posted: Wed Dec 17, 2008 4:59 pm
by expedient
Using Mac OS X:

1. Open TextEdit

2. Open Preferences (cmd-,) and set HTML Saving Options in the Open and Save tab to:
a) Document type: HTML 4.01 Strict
b) Styling: No CSS
c) Encoding: Unicode (UTF-8)
d) Check Preserve White Space
Close preferences
html-saving-options.gif
Screen grab
(11.05 KiB) Downloaded 1067 times
3. Open Word file into TextEdit

4. Save As.. (shift-cmd-s) File Format: HTML

These settings will ensure a very clean single file HTML format. If you wish to preserve font type settings then you can use Embedded or Inline CSS.

Hope this helps.

Re: going from .doc to html

Posted: Thu Dec 18, 2008 2:54 pm
by Bartokian
Thanks Expedient,

This little trick allows me indeed to read the compounded files. I found out I could even save them as docx files which I was able to open in Office!
Is seems that the VB macros (which are removed in this manner) are the underlying cause of our troubles...

mile gracie, :o