Skip to main content
connect to impala from VM Ubuntu using Kerberos key tab                       
On client machine:(Ubuntu-VM)
1)  Download unix odbc driver:
http://www.unixodbc.org/unixODBC-2.3.4.tar.gz./configure make make install

2) install Cloudera impala odbc driver 
clouderaimpalaodbc_2.5.32.1002-2_amd64.debhttp://www.cloudera.com/downloads/connectors/impala/odbc/2-5-32.html
 dpkg -i clouderaimpalaodbc_2.5.32.1002-2_amd64.deb

3) cp /etc/odbc.ini and /etc/odbcinst.ini to home directory.

4) cat /home/user_impala/odbc.ini
[ODBC DATA Sources]Impala_DSN=Cloudera Impala ODBC Driver 64-bit[Impala_DSN]Driver=/opt/cloudera/impalaodbc/lib/64/libclouderaimpalaodbc64.soHOST=Impala.dev.Server.comPort=21050Database=irisAuthMech=1KrbRealm=DEV.SERVER.COMKrbFQDN=Impala.dev.Server.comKrbServiceName=impalaUID=user_impalaUseKeytab=1;#UPNKeytabMappingFile=/home/user_impala/service.keytabDefaultKeytabFile =/home/user_impala/service.keytab 

 5)cat /home/user_impala/odbcinst.ini 
[ODBC Drivers]Cloudera Impala ODBC Driver 64-bit=InstalledUsageCount=2[Impala_ODBC_Driver]Description=Cloudera Impala ODBC Driver (64-bit)Driver=/opt/cloudera/impalaodbc/lib/64/libclouderaimpalaodbc64.soUsageCount=2

6)Keep user_impala krb5.conf (from Implaa server) in /etc 
sudo cp krb5.conf  /etcsudo chmod 755 /etc/krb5*

4) Export Env.
export ODBCINI=/home/user_impala/odbc.iniexport ODBCSYSINI=/home/user_impala/odbcinst.iniexport CLOUDERAIMPALAODBCINI=/opt/cloudera/impalaodbc/lib/64/cloudera.impalaodbc.iniexport LD_PRELOAD=/usr/lib/libodbcinst.soexport LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/lib:/opt/cloudera/impalaodbc/lib/64:.


5) Done
 $ isql -v Impala_DSN

+---------------------------------------+
| Connected! 
| sql-statement  
| help [tablename]
| quit
+---------------------------------------+
SQL> show tables;
+---------------------------------------------------------------------
| name    
+---------------------------------------------------------------------
| commodity_data            
| ml_data 
| mt_data_base  
| mdl_data_inc 
| processed_data_sample 
+---------------------------------------------------------------------
SQLRow Count returns -1 
19 rows fetched
SQL>
 ^Z[1]+  Stopped


 more details:
 http://www.cloudera.com/documentation/other/connectors/impala-odbc/latest/Cloudera-ODBC-Driver-for-Impala-Install-Guide.pdf


Comments

Popular posts from this blog

Installing Hadoop ecosystim in pseudo mode in UBUNTU 12.04 LTS

Changes Done in UBUNTU 12.04 64-bit : Post Installation 1. Access To Root:     eagroup@BI-Lab:~$ sudo su     [sudo] password for eagroup:     root@BI-Lab:/home/eagroup# sudo passwd     Enter new UNIX password: password     Retype new UNIX password: password     passwd: password updated successfully     root@BI-Lab:/home/eagroup# 2.  Add and manage users and groups:     a. Add user from GUI - hduser,     Username     Password    Privilage     root        password    root     eagroup        password    admin     hduser        hduser        hadoop user 3. Making hduser sudoer:     a. login as root   ...

Free easy Twitter sentiment analysis using R console (r-studio)in ubuntu(linux)

Twitter sentiment analysis using r-studio (r console) in ubuntu 1)  login as user su 2) Download  all files from following link: https://drive.google.com/folderview?id=0B1WeP8XHW0OzcEY2TEtwMlZDTmc&usp=sharing#list keep these files in /home/hduser/sentiment ( sample path) positive-words.txt negative-words.txt sentiment.r 3) pre-requisites open  /etc/apt/sources.list and add deb http://<my.favorite.cran.mirror>/bin/linux/ubuntu raring/ sudo apt-get install r-base -- now we have R Console in ubuntu. $ cd /home/hduser/sentiment/ call "R" from command prompt $ R -- Sett working directory(wd) and libraries required for analysis. setwd("/home/hduser/sentiment") install.packages('twitteR') install.packages("ROAuth") install.packages("RCurl") install.packages("plyr") install.packages("stringr") --  it's not required since we have this file. if u want u can download it  b...

how to get hive table size from metastore mysql

select    d.name  as db_name ,t.tbl_name     as tbl_name ,from_unixtime(min(t.create_time))   as create_time ,min(t.owner)          as owner ,min(case when tp.param_key = 'COLUMN_STATS_ACCURATE'  then tp.param_value                 end) as COLUMN_STATS_ACCURATE ,min(case when tp.param_key = 'last_modified_by'       then tp.param_value                 end) as last_modified_by ,min(case when tp.param_key = 'last_modified_time'     then from_unixtime(tp.param_value)  end) as last_modified_time  ,min(case when tp.param_key = 'numFiles'               then tp.param_value                 end) as numFiles ,min(case when tp.param_key = 'numRows'                th...