Load time - ODBC vs text file
Posted: Sat May 07, 2022 11:49 pm
Just wondering if there is any known performance hit when loading from an ODBC source vs text. Having just implemented HANA we are now going through the process of moving from relying on text file loads enabling direct querying from TM1 to BW on HANA (Yes we are moving from the dark ages). I started off with a simple read with a count accumulation in the data tab and an export of the total count in the epilog. That was fast - no issues.
However having redeveloped the existing load process (No fundamental changes) to use the ODBC data source the load time is about double what it was from text. The number of rows is approximately 15 million and it is taking 1 hr10 mins to load. I have looked at the HANA conneciton property options but I don't see anything obvious which might be causing an issue, and as I say when I do a simple read and count accumulation it is very fast - just a few minutes. I know most people don't deal with these volumes and hence maybe I am experiencing something most wouldn't notice.
As a general rule should we expect ODBC sourced loads to be so much slower? If so, any suggestion as to why?
When I performed the original data read test I also tested a parallel read using Runprocess. I found no material difference in the total read time. I don't know if that has to do with TM1 not being able to perform concurrent ODBC sourced loads or if it's the HANA driver. I have not tried TM1RUNTI.
Should I expect to be able to perform parallel ODBC loads with performance gains? If so what about with HANA?
However having redeveloped the existing load process (No fundamental changes) to use the ODBC data source the load time is about double what it was from text. The number of rows is approximately 15 million and it is taking 1 hr10 mins to load. I have looked at the HANA conneciton property options but I don't see anything obvious which might be causing an issue, and as I say when I do a simple read and count accumulation it is very fast - just a few minutes. I know most people don't deal with these volumes and hence maybe I am experiencing something most wouldn't notice.
As a general rule should we expect ODBC sourced loads to be so much slower? If so, any suggestion as to why?
When I performed the original data read test I also tested a parallel read using Runprocess. I found no material difference in the total read time. I don't know if that has to do with TM1 not being able to perform concurrent ODBC sourced loads or if it's the HANA driver. I have not tried TM1RUNTI.
Should I expect to be able to perform parallel ODBC loads with performance gains? If so what about with HANA?