- Community Home
- >
- Software
- >
- HPE Ezmeral Software platform
- >
- Connect Unified Analytics (Presto) to Ezmeral Data...
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО06-12-2024 09:52 AM
тАО06-12-2024 09:52 AM
Connect Unified Analytics (Presto) to Ezmeral Data Fabric using Hive Parquet connector
Hi, I'm wondering what I'm doing wrong. I got a big CSV and converted it to parquet using Python Pandas and Numpy. Then I upload it to Object Storage on Data Fabric and read it using Hive Connector from Unified Analytics. The problem is... when I use a smal sample of file it works well but when I try with a larger sample (but not too large... around 100 MB) I got this problem:
Query failed (#20240612_164659_00147_ekmiu): can not read class org.apache.parquet.format.PageHeader: Required field 'uncompressed_page_size' was not found in serialized data! Struct: org.apache.parquet.format.PageHeader$PageHeaderStandardScheme@30b217e0
Always this 'uncompressed_page_size' stuff. I'm wondering if I need to customize something in the Presto Pod configuration.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО06-12-2024 10:53 AM - edited тАО09-16-2024 02:19 AM
тАО06-12-2024 10:53 AM - edited тАО09-16-2024 02:19 AM
Query: Connect Unified Analytics (Presto) to Ezmeral Data Fabric using Hive Parquet connector
System recommended content:
1. HPE Ezmeral Unified Analytics Software 1.2 Documentation | Hive Connection Parameters
2. HPE Ezmeral Unified Analytics Software 1.3 Documentation | Hive Connection Parameters
Please click on "Thumbs Up/Kudo" icon to give a "Kudo".
Thank you for being a HPE valuable community member.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО06-13-2024 04:30 AM - last edited on тАО06-13-2024 05:36 AM by Sunitha_Mod
тАО06-13-2024 04:30 AM - last edited on тАО06-13-2024 05:36 AM by Sunitha_Mod
Re: Connect Unified Analytics (Presto) to Ezmeral Data Fabric using Hive Parquet connector
Connecting Unified Analytics Presto to Ezmeral Data Fabric involves configuring Presto to access data stored in Ezmeral's distributed file system. This integration enables advanced analytics on large datasets, leveraging Presto's SQL querying capabilities with Ezmeral's robust data management. Follow the connection setup guide, configure the necessary connectors, and ensure proper authentication to facilitate seamless and efficient data analysis across the platforms.