Please be aware that there are hundreds more characters in a DOC file that need conversion to HTML other than italics and bold text.
The Wc3 have a page of recommended convertor files here:
http://www.w3.org/Tools/Word_proc_filters.html
And here you can find the actual characters and their HTML equivalents:
http://www.w3schools.com/tags/ref_entities.asp
Some examples:
Reserved Characters in HTML
Some characters are reserved in HTML and XHTML. For example, you cannot use the greater than or less than signs within your text because the browser could mistake them for markup.
HTML and XHTML processors must support the five special characters listed in the table below:
Character Entity Number Entity Name Description
" " " quotation mark
' ' ' (does not work in IE) apostrophe
& & & ampersand
< < < less-than
> > > greater-than
Note: Entity names are case sensitive!
ISO 8859-1 Symbols
Character Entity Number Entity Name Description
  non-breaking space
¡ ¡ ¡ inverted exclamation mark
¢ ¢ ¢ cent
£ £ £ pound
¤ ¤ ¤ currency
¥ ¥ ¥ yen
¦ ¦ ¦ broken vertical bar
§ § § section
¨ ¨ ¨ spacing diaeresis
© © © copyright
ª ª ª feminine ordinal indicator
« « « angle quotation mark (left)
¬ ¬ ¬ negation
­ ­ soft hyphen
® ® ® registered trademark
¯ ¯ ¯ spacing macron
° ° ° degree
± ± ± plus-or-minus
² ² ² superscript 2
³ ³ ³ superscript 3
´ ´ ´ spacing acute
µ µ µ micro
¶ ¶ ¶ paragraph
· · · middle dot
¸ ¸ ¸ spacing cedilla
¹ ¹ ¹ superscript 1
º º º masculine ordinal indicator
» » » angle quotation mark (right)
¼ ¼ ¼ fraction 1/4
½ ½ ½ fraction 1/2
¾ ¾ ¾ fraction 3/4
¿ ¿ ¿ inverted question mark
× × × multiplication
÷ ÷ ÷ division
ISO 8859-1 Characters
Character Entity Number Entity Name Description
À À À capital a, grave accent
Á Á Á capital a, acute accent
   capital a, circumflex accent
à à à capital a, tilde
Ä Ä Ä capital a, umlaut mark
Å Å Å capital a, ring
Æ Æ Æ capital ae
Ç Ç Ç capital c, cedilla
È È È capital e, grave accent
É É É capital e, acute accent
Ê Ê Ê capital e, circumflex accent
Ë Ë Ë capital e, umlaut mark
Ì Ì Ì capital i, grave accent
Í Í Í capital i, acute accent
Î Î Î capital i, circumflex accent
Ï Ï Ï capital i, umlaut mark
Ð Ð Ð capital eth, Icelandic
Ñ Ñ Ñ capital n, tilde
Ò Ò Ò capital o, grave accent
Ó Ó Ó capital o, acute accent
Ô Ô Ô capital o, circumflex accent
Õ Õ Õ capital o, tilde
Ö Ö Ö capital o, umlaut mark
Ø Ø Ø capital o, slash
Ù Ù Ù capital u, grave accent
Ú Ú Ú capital u, acute accent
Û Û Û capital u, circumflex accent
Ü Ü Ü capital u, umlaut mark
Ý Ý Ý capital y, acute accent
Þ Þ Þ capital THORN, Icelandic
ß ß ß small sharp s, German
à à à small a, grave accent
á á á small a, acute accent
â â â small a, circumflex accent
ã ã ã small a, tilde
ä ä ä small a, umlaut mark
å å å small a, ring
æ æ æ small ae
ç ç ç small c, cedilla
è è è small e, grave accent
é é é small e, acute accent
ê ê ê small e, circumflex accent
ë ë ë small e, umlaut mark
ì ì ì small i, grave accent
í í í small i, acute accent
î î î small i, circumflex accent
ï ï ï small i, umlaut mark
ð ð ð small eth, Icelandic
ñ ñ ñ small n, tilde
ò ò ò small o, grave accent
ó ó ó small o, acute accent
ô ô ô small o, circumflex accent
õ õ õ small o, tilde
ö ö ö small o, umlaut mark
ø ø ø small o, slash
ù ù ù small u, grave accent
ú ú ú small u, acute accent
û û û small u, circumflex accent
ü ü ü small u, umlaut mark
ý ý ý small y, acute accent
þ þ þ small thorn, Icelandic
ÿ ÿ ÿ small y, umlaut mark