We are looking for part-time help from a Big Data architect that has experience with Cloudera, Hive, Tableau, and AWS.
Currently we have Tableau Desktop running on AWS that is trying to connect to Hive sitting on a small Cloudera cluster. We are trying to run some aggregations on a 5 GB CSV file that is sitting on HDFS within Cloudera, but the reports are taking a long time.
We want this person to help us performance tune this issue for us asap.
2+ years of experience with HIve, Cloudera, Big Data architecture
Good experience in connecting Tableau to data sources sitting on AWS
Past experience in performance tuning issues such as the one explained above