Does Spark-HPCC Connector support child datasets?

Description

If so, is this the same as "nested tables"?  Does it also support SQL syntax for nested tables?

Conclusion

None

Activity

Show:

James Wiltshire February 3, 2021 at 9:31 PM
Edited

Any idea how hard it would be to update WsSQL logic to allow queries against files with child datasets?

I believe my client would care less about being able to filter against child dataset rows/fields - but would probably be happy with my option 1, 2, or 3 above.

 

 

(edit...)

Rough analysis of the code that disallows child datasets in WsSQL...

When a query is sent to WsSQL for a file with child datasets, the esp.log shows this error message:

Nested data files not supported

https://github.com/hpcc-systems/HPCC-Platform/blob/c4487f7e71a37df7db20b3f3a6f0d985f6e5f2fc/esp/services/ws_sql/SQL2ECL/HPCCFileCache.cpp

...where the "containsNestedColumns" value is originally set here:

https://github.com/hpcc-systems/HPCC-Platform/blob/c4487f7e71a37df7db20b3f3a6f0d985f6e5f2fc/esp/services/ws_sql/SQL2ECL/HPCCFile.cpp

 

 

 

 

Rodrigo Pastrana February 3, 2021 at 6:40 PM

reopen if more info is needed.

James McMullan February 3, 2021 at 6:24 PM
Edited

 The Spark HPCC Connector does allow for reading / writing datasets with child datasets. This will automatically be handled and converted into the appropriate types for the Spark Dataframe.

James Wiltshire February 2, 2021 at 8:44 PM

WsSQL appears to NOT support child datasets in a select.

I.e., it fails doing a SELECT (even one that does NOT explicitly reference a child dataset field) against a logical file (table) that has child datasets.

 

Ways I can imagine to possibly "partially support" child datasets in WsSQL or even Spark HPCC Connector - allow SELECTs against these tables, but:

  1. Include child dataset fields as Xml text in a single string field.

  2. Include an empty field where the child dataset is.  (With a warning in the logs?)

  3. Exclude the child data field.  (With a warning in the logs?)

 

Fixed
Pinned fields
Click on the next to a field label to start pinning.

Details

Components

Assignee

Reporter

Priority

Fix versions

Pull Request URL

Created January 29, 2021 at 12:36 AM
Updated April 15, 2021 at 4:07 PM
Resolved April 15, 2021 at 4:06 PM