Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
  1. Home
  2. Web Development
  3. ASP.NET
  4. Converting Doc file to XML & read the XML

Converting Doc file to XML & read the XML

Scheduled Pinned Locked Moved ASP.NET
comxmlhelp
2 Posts 2 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • V Offline
    V Offline
    verma rahul
    wrote on last edited by
    #1

    I am creating an web application which need to convert document files into XML. Then read the xml files for specific words in specific format. I am using Microsoft.Office.Interop for converting the document files to xml .The files are getting generated but with lots of formating information which leads to heavy file. I need an help to write a code which can reduce the xml files by removing the unwanted document formating. Or can be preserved if required. Thanks in advance.

    Learner always

    M 1 Reply Last reply
    0
    • V verma rahul

      I am creating an web application which need to convert document files into XML. Then read the xml files for specific words in specific format. I am using Microsoft.Office.Interop for converting the document files to xml .The files are getting generated but with lots of formating information which leads to heavy file. I need an help to write a code which can reduce the xml files by removing the unwanted document formating. Or can be preserved if required. Thanks in advance.

      Learner always

      M Offline
      M Offline
      minhpc_bk
      wrote on last edited by
      #2

      To get rid of the document format, you may consider generating the xml document on your own. That being said, you can use Automation or a third party component (a better option IMO) to access the word document, and use xsl to transform data to generate a light xml file.

      1 Reply Last reply
      0
      Reply
      • Reply as topic
      Log in to reply
      • Oldest to Newest
      • Newest to Oldest
      • Most Votes


      • Login

      • Don't have an account? Register

      • Login or register to search.
      • First post
        Last post
      0
      • Categories
      • Recent
      • Tags
      • Popular
      • World
      • Users
      • Groups