Hive Change Log Release 0.3.1 - Unreleased INCOMPATIBLE CHANGES NEW FEATURES HIVE-61. Implement "ORDER BY". (Namit Jain via zshao) HIVE-376. In strict mode do not allow join without "ON" condition. (Namit Jain via zshao) IMPROVEMENTS HIVE-440. Reducer and Join to print out number of rows processed. (Namit Jain via zshao) OPTIMIZATIONS BUG FIXES HIVE-416. Fix backtrack in Hive.g (Zheng Shao via namit) HIVE-442. Move the data before creating the partition. (Prasad Chakka via zshao) HIVE-447. Fix tests on hudson. (Ashish Thusoo via zshao) HIVE-453. Use FileSystem derived from user supplied path when present instead of default. (Joydeep Sen Sarma via prasadc) HIVE-816. Cache MetastoreClient. (Dave Lerman via zshao) Release 0.3.0 - 2009-04-20 INCOMPATIBLE CHANGES NEW FEATURES HIVE-337. LazySimpleSerDe to support multi-level nested array, map, struct types. (zshao) HIVE-313. Add UDF date_add, date_sub, datediff. (zshao) HIVE-79. Print number of rows inserted to table(s). (Suresh Antony via zshao) HIVE-30. Hive Web User Interface. (Edward Capriolo via athusoo) HIVE-343. Include py packages for thrift service. (Raghotham Murthy via zshao) HIVE-299. Include php packages for thrift service. (Raghotham Murthy via zshao) HIVE-393. Remove unnecessary checks for file type. (Zheng Shao via namit) HIVE-82. Support for building tarballs and javadocs (Ashish Thusoo via rmurthy) HIVE-407. Permission issues in test scripts in tarballs. (Ashish Thusoo via namit) HIVE-61. Implement "ORDER BY". (Namit Jain via zshao) HIVE-376. In strict mode do not allow join without "ON" condition. (Namit Jain via zshao) IMPROVEMENTS HIVE-132. Show table and describe results to be read via FetchTask. (Raghotham Murthy via zshao) HIVE-427. Add missing config parameters in hive-default.xml. (Namit Jain via zshao) HIVE-440. Reducer and Join to print out number of rows processed. (Namit Jain via zshao) HIVE-440. Fixed bug in Reducer and Join to print out number of rows processed. (Zheng Shao via namit) OPTIMIZATIONS BUG FIXES HIVE-349. Fix TestCliDriver when there are test cases with no tasks. (Suresh Antony via zshao) HIVE-339. Fix count distinct in 1 map-reduce job with map side aggregation. (Namit Jain via zshao) HIVE-271. Move test-udfs creation to build-common in order to fix broken 0.17 build. (athusoo) HIVE-286. Use round(xxx,12) to make sure there is no precision matching problem in testCliDriver_udf7. (zshao via johan) HIVE-327. Fix row counts printed in hive history to reflect the count of rows from the query in the session. (Suresh Anthony via athusoo) HIVE-334. Make union3 test output deterministic. (Joydeep Sen Sarma via athusoo) HIVE-320. Fix intermittent nullpointer exception in count(DISTINCT). (Namit Jain via athusoo) HIVE-124. Fix empty aggregations to return 1 row instead of nothing. (namit) HIVE-347. Fix lot of partition scans in strict mode in case of error in partition specification error. (namit) HIVE-251. Throw an error to the CLI if a user defined script fails in TRANSFORM, MAP or REDUCE commands. (athusoo) HIVE-317. Fix HiveServer to accept 1 argument for port. (Min Zhou via athusoo) HIVE-285. Fix to disallow union of different schemas. (Raghotham Murthy via namit) HIVE-253. Fix to disallow computation of rand() at compilation stage. (Raghotham Murthy via namit) HIVE-350. Fix wrong order of aggregations in explain plan. (namit) HIVE-347. undo for Fix lot of partition scans in strict mode in case of error in partition specification error. (namit) HIVE-319. Add UDF for unix timestamp. (Hao Liu via namit) HIVE-363. Use 1 reducer in case of aggregations with no grouping keys. (namit) HIVE-366. Make id a non-bean attribute so that TestParse does not depend on that. (namit) HIVE-342. Fix TestMTQueries (Prasad Chakka via namit) HIVE-318. Fix union all queries. (Namit Jain via zshao) HIVE-367. Null Pointer when some of the files in the table are empty. (Namit Jain via athusoo) HIVE-278. Add support for Hive History to Hive Web UI. (Edward Capriolo via athusoo) HIVE-356. Treat Object[] and List differently. (Zheng Shao via namit) HIVE-373. 1 reducer should be used if no grouping key is present in all scenarios. (Namit Jain via zshao) HIVE-382. fixed problems in map-side aggr - backported from trunk. (namit) HIVE-391. Fix for UDAFCount which was not handling merging nulls (Namit Jain via rmurthy) HIVE-384. Fixing UNION ALL when the first job is a map-only job. (Namit Jain via zshao) HIVE-394. Break plans in Cluster Mode. (Namit Jain via zshao) HIVE-324. Fix AccessControlException when loading data. (Ashish Thusoo via zshao) HIVE-399. Fix timeout problems caused due to the bigdata test. (Namit Jain via athusoo) HIVE-397. Make union tests deterministic by ordering the output results. (Namit Jain via athusoo) HIVE-400. Fix bug in genColList. (Namit Jain via zshao) HIVE-390. Make partition pruner tests deterministic by ordering the output results. (Namit Jain via athusoo) HIVE-403. remove password param from job config that is submitted to job tracker and from log (Prasad Chakka) HIVE-372. Fix parse time Memory consumption for nested UDFs. (Zheng Shao via namit) HIVE-415. Fix long query parsing problems. (Namit Jain via zshao) HIVE-423. Change launch templates to use hive_model.jar. (Raghotham Murthy via zshao) HIVE-421. Fix union followed by multi-table insert. (Namit Jain via zshao). HIVE-404. Fix ordering in "SELECT * FROM t SORT BY col1 LIMIT 100" when query is a outer-most query. (Namit Jain via zshao) HIVE-432. Fix "SORT BY" using only one reducer. (Zheng Shao via namit) HIVE-435. Fix Null pointer exception if password is null. (Prasad Chakka via namit) HIVE-416. Fix backtrack in Hive.g (Zheng Shao via namit) HIVE-442. Move the data before creating the partition. (Prasad Chakka via zshao) HIVE-447. Fix tests on hudson. (Ashish Thusoo via zshao) Release 0.2.0 - Unreleased INCOMPATIBLE CHANGES HIVE-194. Support specifying decimal places for ROUND function. ROUND always rounds away from zero now. (David Phillips via zshao) NEW FEATURES HIVE-223. Single map-reduce group-by using map-side aggregation. (Namit Jain via zshao) HIVE-269. Add log/exp/pow UDF functions to Hive. (zshao) HIVE-258. New UDF IF(test, valueTrue, valueFalseOrNull). (zshao) HIVE-105. Automatically estimate number of required reducers. (zshao) HIVE-176. Added a history log for Hive. (Suresh Anthony via athusoo) HIVE-244. Add SQRT() UDF. (Jeff Hammerbacher via zshao) HIVE-216. Generate ruby bindings for service. (Raghotham Murthy via zshao) HIVE-163. JSON udf function added. (Hao Liu via zshao) HIVE-142. Add a metastore check command. (Johan Oskarsson via zshao) HIVE-91. Add ADD PARTITION support to ALTER TABLE in order to add partitions to a table. (Johan Oskarsson via athusoo) HIVE-148. Extend bin/hive to include the lineage tool. (Suresh Antony via zshao) HIVE-147. Add a tool for extracting lineage info from hive sql. (Suresh Antony via zshao) HIVE-48. Support JDBC connections for interoperability between Hive and RDBMS. (Raghotham Murthy and Michi Mutsuzaki via dhruba) HIVE-140. Event Based Infrastructure for Syntax Trees in the compiler. (Ashish Thusoo through zshao) HIVE-98. Dependency management with hadoop core using ivy. (Ashish Thusoo through zshao) HIVE-73. Thrift Server and Client for Hive (Raghu through zshao) HIVE-113. Distribute by and sort by support. (zshao) HIVE-118. Add date related functions. (zshao) HIVE-62. Support nested types. (Ashish Thusoo via dhruba) IMPROVEMENTS HIVE-300. DISTRIBUTE BY now supports expressions. (zshao) HIVE-292. Replace MetadataTypedColumnsetSerde with LazySimpleSerDe. (zshao) HIVE-294. Support MAP(a.*), REDUCE(a.*) and TRANSFORM(a.*). (zshao) HIVE-270. Add a lazy-deserialized SerDe for efficient deserialization of rows with primitive types. (zshao) HIVE-288. Removed the old serde package. (zshao) HIVE-240. Support CREATE TABLE IF NOT EXISTS. (zshao) HIVE-126. Don't fetch information on Partitions from HDFS instead of MetaStore. (Johan Oskarsson via zshao) HIVE-209. Enabling asserts in tests. (Prasad Chakka via zshao) HIVE-177. Allow specifying custom input and output format in CREATE TABLE. (David Phillips via zshao) HIVE-166. Create eclipse project template. (Ashish Thusoo via zshao) HIVE-186. Refactor code to use a single graph, nodeprocessor, dispatcher and rule abstraction. (Ashish Thusoo via zshao) HIVE-182. Add database name parameter to getTable and dropTable. (Johan Oskarsson via zshao) HIVE-181. Restore UDFTestLength unit test for UDFs. (David Phillips via zshao) HIVE-107. Case structure for bin/hive. (Edward Capriolo via zshao) HIVE-101. Set svn:ignore for the root. (Johan Oskarsson via zshao) HIVE-170. Map-side aggregations to estimate memory size. (Namit via zshao) HIVE-180. Data Generator for thrift-serialized sequence files. (zshao) HIVE-157. Update README.txt to remove refs to mirror.facebook.com. (zshao) HIVE-95. Improve cli error messages by lowering backtracking to 1. (Pete Wyckoff through zshao) HIVE-102. Refactor DDLTask. (Johan Oskarsson through zshao) HIVE-85. New compression options for Hive. (Joydeep Sarma through zshao) HIVE-69. genMapRedTasks uses tree walker. (Namit through zshao) HIVE-249. Give execute permissions to the hive binary (Jeff Hammerbacher via athusoo) HIVE-231. Create junit reports for tests. (Johan Oskarsson via athusoo) HIVE-119. Make error messages more informative. (Pete Wyckoff via athusoo) OPTIMIZATIONS BUG FIXES HIVE-340. Fixed null pointer exception with nulls in map-side aggregation. (Namit Jain via zshao) HIVE-308. UNION ALL: FileSinkOperator now adds files in case the target exists. (zshao) HIVE-276. Fix input3_limit.q for hadoop 0.17. (zshao) HIVE-131. Remove uncommitted files from failed tasks. (Joydeep Sen Sarma via zshao) HIVE-264. TBinarySortable Protocol supports null characters. (zshao) HIVE-255. Propagate user name to Hive metastore. (Prasad Chakka via zshao) HIVE-263. TCTLSeparatedProtocol should use UTF-8 to encode/decode the data. (zshao) HIVE-255. Propagate user name to Hive metastore. (Prasad Chakka via zshao) HIVE-235. Fixed DynamicSerDe to work with null values with Thrift Protocols that can have missing fields for null values. (zshao) HIVE-222. Fixed Group by on a combination of disitinct and non distinct aggregates. (Ashish Thusoo via zshao) HIVE-161. Fixed xpath x.y when x is a null list. (zshao) HIVE-206. Fixed partition pruning check for uppercase table names. (Ashish Thusoo via zshao) HIVE-151. Tasks depending on other tasks cannot be a root task. (Namit Jain via zshao) HIVE-197. Fixed UDFToBoolean. (zshao) HIVE-171. Fixed select star with cluster by, distribute by and sort by. (zshao) HIVE-190. Fixed tests failed due to test running order. (zshao) HIVE-184. Tests fail due to including old hive jar files. (David Phillips via zshao) HIVE-188. Hive CLI quits when stdout is closed. (zshao) HIVE-104. Tables with at least 1 non-string columns to use DynamicSerDe. (zshao) HIVE-158. Make table aliases work for sampled tables in joins. (Raghotham Murthy via zshao) HIVE-168. Fixed join on a subquery with a group by. (Namit Jain via zshao) HIVE-169. Fixed configuration parameter used for determining join interval in JoinOperator. (Namit Jain via zshao) HIVE-114. Drop partition does not delete data for external tables now. (Johan Oskarsson via zshao) HIVE-144. Hive/ql java source copied to build/ql/java to make it work with eclipse. (Johan Oskarsson via zshao) HIVE-129. Fix aux.jar packaging to work properly with 0.17 and 0.18 versions of hadoop. (Joydeep Sarma via zshao) HIVE-162. Fix join0.q test failure with hadoop 0.17. (zshao) HIVE-146. Fix builds for non-default build directory. (David Phillips through zshao) HIVE-47. CLI to support multiline queries from an input file. (Ashish through zshao) HIVE-103. Disable inserts in sub queries. (Namit Jain through zshao) HIVE-92. Fixed union all for non-embedded query. (Namit Jain through zshao) HIVE-93. Dynamic serde to handle _-prefixed column names in DDL. (Pete Wyckoff through zshao) HIVE-127. Fetch Task to pass the jobconf to SerDe. (Namit through zshao) HIVE-116. Let test depend on deploy. (Johan Oskarsson through zshao) HIVE-90. Fixed TestGetDBs for File.list() order. (Johan Oskarsson through zshao) HIVE-111. Support join without a ON clause. (Namit through zshao) HIVE-77. Thread safe query execution. (Joydeep through zshao) HIVE-86. Drop table should not delete data for external tables. (Johan Oskarsson through zshao) HIVE-108. Sampling to use internal column name. (Namit through zshao) HIVE-60. Fix the build scripts for the new hive tree. (Ashish through zshao) HIVE-54. Create this file to record the patches going into a particular release. (dhruba) HIVE-12. Improve README for Hive. (Ashish Thusoo via dhruba) HIVE-68. Fix casting to short. (Ashish Thusoo via dhruba) HIVE-70. A configuration variable that limits the amount of data writen by a task to its standard error. (Joydeep Sen Sarma via dhruba) HIVE-17. Remove duplicate entries for hive.metastore.uris in config file. (Jeff Hammerbacher via dhruba) HIVE-72. Generate correct results when partition pruning is not strict and no map-reduce jobs are needed. (Namim Jain via dhruba) HIVE-75. Make limit work when applied to outermost block that is not a query. (Namim Jain via dhruba) HIVE-76. Fix for column number mismatch between query and destination tables when alias.* expressions are present in the select list of a join. (Ashish Thusoo via dhruba) HIVE-196. Two test runs can run simultaneously on the same machine. (Ashish Thusoo via dhruba) HIVE-202. Fix Lineage for join queries. (Suresh Antony via dhruba) HIVE-84. Make MetaStore Client thread safe. (Prasad Chakka via dhruba) HIVE-203. Fix eclipse templates to get junit tests to run. (Raghotham Murthy via dhruba) HIVE-220. Incorrect log directory in TestMTQueries causing null pointer exception. (Prasad Chakka via dhruba) HIVE-230. Fix for a Null Pointer Exception that occured while loading a table from a query that returns empty data. (Prasad Chakka via dhruba) HIVE-232. Metastore.warehouse configuration should use inherited hadoop configuration. (Prasad Chakka via dhruba) HIVE-239. Check that replace columns in alter table does not have names that are same as the partitioning columns. (Prasad Chakka via athusoo) HIVE-25. Enable Table aliases in cluster by, distribute by and sort by clauses. (Prasad Chakka via athusoo) HIVE-217. Report progress during FileSinkOperator in order to avoid Stream closes exceptions. (Johan Oskarsson via athusoo) HIVE-242. Fix to allow CLI to interpret line breaks in any CLI command. (Prasad Chakka via athusoo) HIVE-65. Rewrite typechecking to use the walker interface and add Resolvers to UDF and UDAF to support proper implicit casting in a manner similar to other RDBMSs. (athusoo) HIVE-246. Fix to make msck command to look for tables in hdfs that are in the metastore. (Johan Oskarsson via athusoo) HIVE-248. Create file path names properly during the load command in case relative path names were specified. (Prasad Chakka via athusoo) HIVE-179. Make substr behavior similar to other RDBMS. (David Phillips via athusoo) HIVE-257. Set hive history log location to build for tests. (Suresh Anthony via athusoo) HIVE-256. Fix in map side aggregation wherein we were not including private members of the records in calculating the avg size of the rows and we were also not estimating variable length columns properly. (Namit Jain via athusoo) HIVE-262. Fix for joins wherein a merged join having and outer or right outer join gave spurious rows in join. (Namit Jain via athusoo) HIVE-189. Make sampling tests deterministic by sorting output. (athusoo) HIVE-273. Fix avg UDAF which got broken hive 65. (athusoo) HIVE-283. Do case insensitive comparison of aliases in partition pruning. (athusoo)