How to connect HBase using Apache Phoenix from Pentaho Kettle

In Our Office Mustafa needs to connect HBase from Pentaho Kettle. We find a solution for the problem. I want to share who need this.

  1. Download suitable  Apache Phoenix version for you from the website: http://phoenix.apache.org/download.html
  2. Copy two files from source directory to: PENTAHO_INSTALL_PATH/lib/ phoenix-core-4.3.1.jar phoenix-4.3.1-client.jar
  3. Create a new project in Pentaho: File -> New -> Transformation
  4. From left pane select **Design -> Input -> Table Input **and drag it to your transformation
  5. Double click to your table input and give a name to your step
  6. Click new next to Connection select box to create a new database connection
  7. Give your connection a name (Ex: Phoenix)
    Connection Type: Generic Database
    Access: Native (JDBC)
    Custom Connection URL: Your ZooKeeper Hosts (Ex: jdbc:phoenix:localhost:2181:/hbase)
    Custom Driver Class Name: org.apache.phoenix.jdbc.PhoenixDriver
    And then click Ok to close database connection settings popup

Thanks to Mustafa Artuc

image

10 Haziran 2015

Posted In: apache hbase, apache phoenix, hbase, pentaho, pentaho kettle, phoenix

The replication is turned off HBase, why is oldWALs directory still holded ?

Disabled replication will still hold on to the WAL files because, because it has a guarantee to not lose data between disable and enable. You can remove_peer, which frees up the WAL files to be eligible for deletion. When you re-add replication peer again, the replication will start from the current status, versus if you re-enable a peer, it will continue from where it left.

12 Mart 2015

Posted In: apache hbase, hbase, replication, WAL

WP Twitter Auto Publish Powered By : XYZScripts.com