Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
  1. Home
  2. General Programming
  3. XML / XSL
  4. Performance issue in XSL processing

Performance issue in XSL processing

Scheduled Pinned Locked Moved XML / XSL
helpxmljavaai-modelsjson
11 Posts 4 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • P Offline
    P Offline
    praveen3g
    wrote on last edited by
    #1

    Hi All, Feels good to be a part of this forum of technical stalwarts and Gurus. I feel I will get a solution for my long time problem here . The problem we are facing here is the performance issues when processing the large XML files. Using Java API transformer factory in interpretive mode takes unbelievable 6 days for processing huge files which is a point of very serious concern in a production system. Please help me out and do let me know what are the different things required to be attached which will be helpful for the analysis.

    L S J 3 Replies Last reply
    0
    • P praveen3g

      Hi All, Feels good to be a part of this forum of technical stalwarts and Gurus. I feel I will get a solution for my long time problem here . The problem we are facing here is the performance issues when processing the large XML files. Using Java API transformer factory in interpretive mode takes unbelievable 6 days for processing huge files which is a point of very serious concern in a production system. Please help me out and do let me know what are the different things required to be attached which will be helpful for the analysis.

      L Offline
      L Offline
      Lost User
      wrote on last edited by
      #2

      praveen3g wrote:

      takes unbelievable 6 days for processing huge files

      Most people would have figured there was something wrong before this. Take a look at the data you are processing and how you are dealing with it. The chances of anyone being able to guess what is wrong is not very high.

      speaking as ...

      P 1 Reply Last reply
      0
      • L Lost User

        praveen3g wrote:

        takes unbelievable 6 days for processing huge files

        Most people would have figured there was something wrong before this. Take a look at the data you are processing and how you are dealing with it. The chances of anyone being able to guess what is wrong is not very high.

        speaking as ...

        P Offline
        P Offline
        praveen3g
        wrote on last edited by
        #3

        Hi, Thanks for the reply. Answer is no , to the people whom I have checked with was not able to diagnose where the problem is and what is the way around it. Everyone definitely understand that the volume of data in the file is huge , say 100MB xml document but no one till data have given a way to work around it efficiently. Let me know if you wish to have a glance and will look forward to your suggestions

        L 1 Reply Last reply
        0
        • P praveen3g

          Hi, Thanks for the reply. Answer is no , to the people whom I have checked with was not able to diagnose where the problem is and what is the way around it. Everyone definitely understand that the volume of data in the file is huge , say 100MB xml document but no one till data have given a way to work around it efficiently. Let me know if you wish to have a glance and will look forward to your suggestions

          L Offline
          L Offline
          Lost User
          wrote on last edited by
          #4

          This is not the sort of issue that can be resolved in a forum like this. You need to do some analysis of your code and measuring of your processing time and code paths. Try running against some small data files and see how long it takes and try to work out whether that is an acceptable length of time.

          speaking as ...

          P 1 Reply Last reply
          0
          • L Lost User

            This is not the sort of issue that can be resolved in a forum like this. You need to do some analysis of your code and measuring of your processing time and code paths. Try running against some small data files and see how long it takes and try to work out whether that is an acceptable length of time.

            speaking as ...

            P Offline
            P Offline
            praveen3g
            wrote on last edited by
            #5

            I have done my analysis and I guess you do not understand actually what the issue is.Do understand am not a kinder garden student to just post a question and expect a ready made answer as I have done quite a lot of research in it more than you . I will look for someone who can just help me out on what are the different parameters that can be looked for rather than just replying generic. Please do not bother to reply further. Thanks again.

            L 1 Reply Last reply
            0
            • P praveen3g

              I have done my analysis and I guess you do not understand actually what the issue is.Do understand am not a kinder garden student to just post a question and expect a ready made answer as I have done quite a lot of research in it more than you . I will look for someone who can just help me out on what are the different parameters that can be looked for rather than just replying generic. Please do not bother to reply further. Thanks again.

              L Offline
              L Offline
              Lost User
              wrote on last edited by
              #6

              praveen3g wrote:

              you do not understand actually what the issue is.

              Yes, I understand perfectly: your code runs slow and you want to know why.

              praveen3g wrote:

              I will look for someone who can just help me out on what are the different parameters that can be looked for rather than just replying generic.

              Given the information you have provided how can anyone guess what those parameters are?

              speaking as ...

              P 1 Reply Last reply
              0
              • L Lost User

                praveen3g wrote:

                you do not understand actually what the issue is.

                Yes, I understand perfectly: your code runs slow and you want to know why.

                praveen3g wrote:

                I will look for someone who can just help me out on what are the different parameters that can be looked for rather than just replying generic.

                Given the information you have provided how can anyone guess what those parameters are?

                speaking as ...

                P Offline
                P Offline
                praveen3g
                wrote on last edited by
                #7

                I have asked specifically what sort of information I should update here which is required for your analysis as I don't know what logs, JVM settings , XSL snippets which is taking more memory and time you may require for your analysis.If you have asked me any one of the above details or whatever you think is required, I can definitely provide them at first place.

                L 1 Reply Last reply
                0
                • P praveen3g

                  I have asked specifically what sort of information I should update here which is required for your analysis as I don't know what logs, JVM settings , XSL snippets which is taking more memory and time you may require for your analysis.If you have asked me any one of the above details or whatever you think is required, I can definitely provide them at first place.

                  L Offline
                  L Offline
                  Lost User
                  wrote on last edited by
                  #8

                  praveen3g wrote:

                  you may require for your analysis.

                  Like most people on this forum, I do not have the time or resources to analyse an issue like this. As I said earlier this question cannot be answered in a forum such as this. You need to do the analysis and ask a more specific and detailed question before anyone can hope to offer any suggestions.

                  speaking as ...

                  P 1 Reply Last reply
                  0
                  • L Lost User

                    praveen3g wrote:

                    you may require for your analysis.

                    Like most people on this forum, I do not have the time or resources to analyse an issue like this. As I said earlier this question cannot be answered in a forum such as this. You need to do the analysis and ask a more specific and detailed question before anyone can hope to offer any suggestions.

                    speaking as ...

                    P Offline
                    P Offline
                    praveen3g
                    wrote on last edited by
                    #9

                    I have done my part of analysis and even know where it is taking time. So I am left with 2 options 1) Splitting the large XML file to small chunk of files based on a condition. 2) Increasing the DTM threads in Java API so that compiled version of Xalan can be used. Let me know if you have any ideas in one of the options above. I can ask questions to the point on the option you say

                    1 Reply Last reply
                    0
                    • P praveen3g

                      Hi All, Feels good to be a part of this forum of technical stalwarts and Gurus. I feel I will get a solution for my long time problem here . The problem we are facing here is the performance issues when processing the large XML files. Using Java API transformer factory in interpretive mode takes unbelievable 6 days for processing huge files which is a point of very serious concern in a production system. Please help me out and do let me know what are the different things required to be attached which will be helpful for the analysis.

                      S Offline
                      S Offline
                      Sasha Laurel
                      wrote on last edited by
                      #10

                      Have you tried using a profiler? A colleague of mine wrote an xml parser/validator in .Net for large files and the first file would complete in about 20-30 minutes. After the first though, it would get incredibly slow, taking around 18-30 hours to process 50mb files. When we checked the memory allocations there were about 10 times more than necessary. The app was running out of memory and continually hitting the page file on disk to make up the difference. All of that because he didn't understand the immutable nature of .Net Strings. EDIT - I realize you are not using .Net, but I am thinking you could try a profiler for java. That is the best advice I can give, good luck friend. :)

                      1 Reply Last reply
                      0
                      • P praveen3g

                        Hi All, Feels good to be a part of this forum of technical stalwarts and Gurus. I feel I will get a solution for my long time problem here . The problem we are facing here is the performance issues when processing the large XML files. Using Java API transformer factory in interpretive mode takes unbelievable 6 days for processing huge files which is a point of very serious concern in a production system. Please help me out and do let me know what are the different things required to be attached which will be helpful for the analysis.

                        J Offline
                        J Offline
                        jinzai
                        wrote on last edited by
                        #11

                        I feel for this poster.... The issue is most likely poor XML and XSLT design. Too many people refuse to take the time to understand their data and XML and in the rush to 'get something working' they create a schema that is bloated, incomplete and that requires a lot of resources to even attempt to use. Six days for 100MB? Yes - I can say that the schema does not represent the data well and the XSL is probably even worse because it must translate this rickety XML into something that might be even more poorly designed and implemented and it is apparent that the XSL has been where all of the changes and fixes have been applied. For starters -- do not avoid using attributes simply because elements seem 'easier' -- they are not. Things that describe a thing -- are attributes. Things that own things -- are usually elements. I post here because -- when you try to engineer a system and write the entire application at the same time -- all the while saying -- just get it running, we'll fill in the details later -- and also you take alot of shortcuts ('it will be easier if we do ....')....all you are doing is moving the work further down the line and making it more difficult.... There is a definite amount of work that must be done -- never assume that you can avoid it and do as much up front as you can.

                        1 Reply Last reply
                        0
                        Reply
                        • Reply as topic
                        Log in to reply
                        • Oldest to Newest
                        • Newest to Oldest
                        • Most Votes


                        • Login

                        • Don't have an account? Register

                        • Login or register to search.
                        • First post
                          Last post
                        0
                        • Categories
                        • Recent
                        • Tags
                        • Popular
                        • World
                        • Users
                        • Groups