Apache HBase

Overview
Apache HBase [1] is an open-source, distributed, versioned, column-oriented
store modeled after Google' Bigtable: A Distributed Storage System for
Structured Data by Chang et al.[2]  Just as Bigtable leverages the distributed
data storage provided by the Google File System, HBase provides Bigtable-like
capabilities on top of Apache Hadoop [3].

To get started using HBase, the full documentation for this release can be
found under the doc/ directory that accompanies this README.  Using a browser,
open the docs/index.html to view the project home page (or browse to [1]).
The hbase 'book' at http://hbase.apache.org/book.html has a 'quick start'
section and is where you should being your exploration of the hbase project.

The latest HBase can be downloaded from an Apache Mirror [4].

The source code can be found at [5]

The HBase issue tracker is at [6]

Apache HBase is made available under the Apache License, version 2.0 [7]

The HBase mailing lists and archives are listed here [8].

The HBase distribution includes cryptographic software. See the export control
notice here [9].

1. http://hbase.apache.org
2. http://research.google.com/archive/bigtable.html
3. http://hadoop.apache.org
4. http://www.apache.org/dyn/closer.lua/hbase/
5. https://hbase.apache.org/source-repository.html
6. https://hbase.apache.org/issue-tracking.html
7. http://hbase.apache.org/license.html
8. http://hbase.apache.org/mail-lists.html
9. https://hbase.apache.org/export_control.html
Comments
  • HBASE-23887 Up to 3x increase BlockCache performance

    HBASE-23887 Up to 3x increase BlockCache performance

    When data much more than BlockCache we can save CPU cycles and increase performance to 3 times. PS Sorry, had some problems with build previuos PR, trying again.

    opened by pustota2009 134
  • HBASE-26666 Add native TLS encryption support to RPC server/client

    HBASE-26666 Add native TLS encryption support to RPC server/client

    • Originally was: HBASE-26666 Address bearer token being sent over wire before RPC encryption is enabled because it's needed for JWT authentication. The bearer token must be transmitted on an encrypted channel.
    • This code is partially based on ZooKeeper's Client TLS implmentation https://issues.apache.org/jira/browse/ZOOKEEPER-2120
    • https://issues.apache.org/jira/browse/HBASE-24506 is also affected, because I had to pass the entire eventLoopGroup to the Netty pipeline, otherwise SSL handshake has a race condition with Call submission

    Still working on unit tests and documentation.

    opened by anmolnar 133
  • HBASE-26259 Fallback support to pure Java compression

    HBASE-26259 Fallback support to pure Java compression

    This change introduces provided compression codecs to HBase as new Maven modules. Each module provides compression codec support without requiring new external dependencies. The codec implementations are shaded into the module artifacts.

    Previously we relied entirely on Hadoop’s hadoop-common to provide compression support, which in turn relies on native code integration, which may or may not be available on a given hardware platform or in an operational environment. It would be good to have alternative codecs, provided in the HBase distribution, for users who for whatever reason cannot or do not wish to deploy the Hadoop native codecs. Therefore we introduce a few new options which offer a mix of pure Java and native accelerated options. Users can continue to use the Hadoop native codecs or choose what best meets their requirements of these new options.

    

A JMH harness for measuring codec performance and microbenchmark results are attached to the JIRA. The test cases are realistic scenarios based on how we use compression in HFile.

    
There is some future opportunity to optimize the block codec compatibility shims in the various modules. Output might not need to be unconditionally buffered. Buffer allocation time was included in the microbenchmark, could maybe pool and re-use buffers. Data movement into and out of buffers might be done with Unsafe. Follow up work. For the initial version we want an implementation known to operate correctly under all tested conditions.

    • hbase-compression: Parent module


    • hbase-compression-aircompressor: Provides codecs for LZO, LZ4, Snappy, and ZStandard, using Aircompressor. These are all pure Java implementations. All except the ZStandard codec are data compatible with the Hadoop codecs. The ZStandard codec is limited by the algorithm implementation in Aircompressor, which is at an early stage and somewhat specific to Trino’s needs. These codecs, with the exception of ZStandard, offer reasonable performance relative to the Hadoop native codecs, for serving as alternatives. Performance is on par with the Hadoop codecs. Depending on parameters and codec the results might be +- 50% . This module is required to complete this work.

    • hbase-compression-lz4: Provides a codec for LZ4, using lz4-java. The codec is data compatible with and is faster than the Hadoop native codec. Up to 50% faster than Hadoop native. This is the fastest compression codec of all alternatives including the Hadoop native codecs. This module is optional but is very nice to have.

    • hbase-compression-snappy: Provides a codec for Snappy, using the Xerial wrapper. The codec is data compatible with the Hadoop native codec. The integration is different, that is the advantage here. Xerial provides native support bundled into the jar. It does not require deployment and management of separate native link libraries. It is slower than the Hadoop native Snappy codec by up to 75% but faster than most other algorithms regardless of implementation. This module is optional.

    • hbase-compression-xz: At my employer we have a private patch to Hadoop that provides a native codec for LZMA (XZ) and support for same in HBase. LZMA is an extremely aggressive and slow compression algorithm that is only suitable for certain specialized uses cases, but a pure Java option is provided by the algorithm’s developer as “XZ For Java”, so we can offer support for the LZMA algorithm now. This module is optional.


    The modular structure will enable us to offer additional alternatives at some future time, as desired.

    opened by apurtell 85
  • HBASE-11062 hbtop

    HBASE-11062 hbtop

    I removed the dependency for Lanterna to remove the license problem.

    We can run hbtop by running hbase top command, and press h key in the top screen for the help screen.

    For the details of hbtop, this is the presentation in NoSQL day 2019 (the name is changed from htop to hbtop): https://dataworkssummit.com/nosql-day-2019/session/supporting-apache-hbase-troubleshooting-and-supportability-improvements/

    opened by brfrn169 81
  • HBASE-25869 WAL value compression

    HBASE-25869 WAL value compression

    WAL storage can be expensive, especially if the cell values represented in the edits are large, consisting of blobs or significant lengths of text. Such WALs might need to be kept around for a fairly long time to satisfy replication constraints on a space limited (or space-contended) filesystem.

    We have a custom dictionary compression scheme for cell metadata that is engaged when WAL compression is enabled in site configuration. This is fine for that application, where we can expect the universe of values and their lengths in the custom dictionaries to be constrained. For arbitrary cell values it is better to use one of the available compression codecs, which are suitable for arbitrary albeit compressible data.

    Microbrenchmark Results

    Site configuration used:

    <!-- retain all WALs  -->
    <property>
      <name>hbase.master.logcleaner.ttl</name>
      <value>604800000</value>
    </property>
    <!-- enable compression -->
    <property>
     <name>hbase.regionserver.wal.enablecompression</name>
     <value>true</value>
    </property>
    <!-- enable value compression -->
    <property>
     <name>hbase.regionserver.wal.value.enablecompression</name>
     <value>true</value>
    </property>
    <!-- set value compression algorithm —>
    <property>
     <name>hbase.regionserver.wal.value.compression.type</name>
     <value>snappy</value>
    </property>
    

    Loader: IntegrationTestLoadCommonCrawl

    Input: s3n://commoncrawl/crawl-data/CC-MAIN-2021-10/segments/1614178347293.1/warc/CC-MAIN-20210224165708-20210224195708-00000.warc.gz

    SNAPPY or ZSTD at level 1 are recommended, all other options provided for comparison.

    Microbenchmarks are collected with this change. Statistics are collected over the lifetime of the regionserver and are dumped at end of test at shutdown. Statistics are updated under synchronization but this is done in a way that excludes that overhead from measurement. The normal patch does not contain either the instrumentation or the synchronization point. Nanoseconds are converted to milliseconds for the table.

    Mode | WALs aggregate size | WALs aggregate size difference | WAL writer append time (ms avg) -- | -- | -- | -- Default | 5,117,369,553 | - | 0.290 (stdev 0.328) Compression enabled, value compression not enabled | 5,002,683,600 | (2.241%) | 0.372 (stddev 0.336) ~~Compression enabled, value compression enabled, v1 patch, Deflate (best speed)~~ | ~~1,209,947,515~~ | ~~(76.4%)~~ | ~~12.694 (stddev 8.48)~~ Compression enabled, value compression enabled, v2 patch, algorithm=SNAPPY | 1,616,387,702 | (68.4%) | 0.027 (stddev 0.204) Compression enabled, value compression enabled, v2 patch, algorithm=ZSTD (best speed) | 1,149,008,133 | (77.55%) | 0.043 (stddev 0.195) Compression enabled, value compression enabled, v2 patch, algorithm=ZSTD (default) | 1,089,241,811 | (78.7%) | 0.056 (stdev 0.310) Compression enabled, value compression enabled, v2 patch, algorithm=ZSTD (best compression) | 941,452,655 | (81.2%) | 0.231 (stddev 1.11) Options below not recommended. | - | - | - Compression enabled, value compression enabled, v2 patch, algorithm=GZ | 1,082,414,015 | (78.9%) | 0.267 (stddev 1.325) Compression enabled, value compression enabled, v2 patch, algorithm=LZMA (level 1) | 1,013,951,637 | (80.2%) | 2.157 (stddev 3.302) Compression enabled, value compression enabled, v2 patch, algorithm=LZMA (default) | 940,884,618 | (81.7%) | 4.739 (stdev 8.609)

    opened by apurtell 77
  • HBASE-26147: Add dry_run_balancer and related Admin interfaces for running the balancer without executing any region moves

    HBASE-26147: Add dry_run_balancer and related Admin interfaces for running the balancer without executing any region moves

    See https://issues.apache.org/jira/browse/HBASE-26147

    I considered adding another overload of the existing balance() methods, but I felt there already existed a lot of overloads there between the deprecated balancer() methods and the force boolean. Adding a separate dryRunBalance() method seemed like the cleanest and most intuitive way to add this new functionality in a backwards compatible way.

    opened by bbeaudreault 73
  • HBASE-22677 Add unit tests for org.apache.hadoop.hbase.util.ByteRangeUtils and Classes

    HBASE-22677 Add unit tests for org.apache.hadoop.hbase.util.ByteRangeUtils and Classes

    I've analysed your codebase and noticed that org.apache.hadoop.hbase.util.ByteRangeUtils and Classes is not fully tested. I've written some tests for the methods in this class with the help of Diffblue Cover.

    Hopefully, these tests will help you detect any regressions caused by future code changes. If you would find it useful to have additional tests written for this repository, I would be more than happy to look at other classes that you consider important in a subsequent PR.

    opened by EricHetti 69
  • HBASE-26807 Unify CallQueueTooBigException special pause with CallDroppedException

    HBASE-26807 Unify CallQueueTooBigException special pause with CallDroppedException

    Creates a ServerOverloadedException as a base class for both exceptions. Creates a new configuration hbase.client.pause.server.overloaded and deprecates hbase.client.pause.cqtbe. Updates all references to "cqtbe" in code to "serverOverloaded", and changes instanceof checks to check for new base class ServerOverloadedException.

    This patch mostly applies cleanly to branch-2, but I'm going to have to also re-implement it for the blocking client there. I will submit a PR for that once I get some initial agreement on the approach here.

    opened by bbeaudreault 68
  • HBASE-20598 - Upgrade to JRuby 9.2

    HBASE-20598 - Upgrade to JRuby 9.2

    After upgrade, prompt changed (mainly what it prints as session name, used to be "main" now the hbase object instance name). We decided to change prompt to"%N:%03n>", %N being Irb.conf[:IRB_NAME] (hbase) and %03n the line number.

    Add to docs: For changelog please check https://www.jruby.org/news

    Main changes: JRuby 9.2.0 minor release:

    • Ruby 2.5 language and stdlib support
    • 225 issues fixed

    JRuby versions up to 9.2.13.0 had various bugfixes, security fixes, performance improvements (memory reduction, runtime generations), IO improvements, basic support for java11,

    Main gems changes:

    • 9.2.11.0: Gems and gem paths packaged inside jar files will properly load now. This 9.2.10.0 regression was caused by a change in RubyGems 3 that exposed a bug in JRuby
    • 9.2.10.0: RubyGems has been updated to version 3.0.6.

    No change in license

    opened by nkalmar 67
  • HBASE-26347 Support detect and exclude slow DNs in fan-out of WAL

    HBASE-26347 Support detect and exclude slow DNs in fan-out of WAL

    This issue provides the method to detect slow datanodes by checking the packets processing time of each datanode connected by the WAL. When a datanode is considered slow, the datanode will be added to an exclude cache on the regionserver, and every stream created will exclude all the cached slow datanodes in a configured period. The exclude logic cooperate with the log rolling logic, will react more sensitively to the lower slow datanodes, whatever there is hardware failure or hotspots. On our production cluster with 20% machines with hardware failure, the RS serves stably and can keep a high availability. Without this feature on the same cluster, there is frequently aborting of RSes, which caused by wal stuck when flushing cache.

    feature 
    opened by sunhelly 65
  • HBASE-22618 Provide a way to have Heterogeneous deployment

    HBASE-22618 Provide a way to have Heterogeneous deployment

    Hi!

    This is the PR regarding HBASE-22618. The goal of the PR is to allow Heterogeneous HBase clusters.

    We would like to allow users to load custom cost functions that can implement newer ways of balance regions.

    This first commit is providing to possibility load them, and a second commit on this branch will add an working example of a custom cost function. This custom function will allow users to balance regions according to an allowed number of regions per RS.

    Before implementing the cost function, I want to validate first the dynamic load :smiley:

    opened by PierreZ 62
  • HBASE-27227 Long running heavily filtered scans hold up too many ByteBuffAllocator buffers

    HBASE-27227 Long running heavily filtered scans hold up too many ByteBuffAllocator buffers

    I still need to add unit tests, but this works in a deployed cluster. Submitting early in case anyone wants to comment on the approach.

    The basic premise is that currently we only call Shipper.shipped() at the final stage of returning an RPC. This was considered the only place where it was safe to release the blocks, which is true for result-referenced blocks. If we can be sure that a block is not referenced by any returned cells, we should be able to release them early. Shipper will still do the final release of anything retained in prevBlocks.

    RegionScannerImpl and StoreScanner both scan rows at a time and enforce different levels of filters. StoreScanner enforces most of the per-cell filters with a hot loop over cells in the row for that store. At this level we can simply release the block if no cells are matched with qcode INCLUDE*. We do this by calling heap.retainBlock() when including a result, which sets a boolean in HFileReaderImpl. When HFileReaderImpl progresses to the next block, it will only add the old block to prevBlocks if retainBlock() had been called. Otherwise it releases it at that point.

    The RegionScannerImpl level enforces filterRow() filters, since it has the full set of cells for the row. If the row is deemed filtered, the results list is cleared. We handle this level by adding a checkpoint system. At the start of a RegionScannerImpl.next call, checkpoint(START) is called. This sets an index in HFileReaderImpl based on the current size of prevBlocks. Once RegionScannerImpl has decided to filter a row, we call checkpoint(FILTERED) which releases any blocks that had been added to prevBlocks in the creation of cleared result list.

    In order to ensure these don't have adverse affects on unexpected use-cases of HFileReaderImpl, the new behavior is only triggered once checkpoint(State) is called on a scanner. Prior to calling checkpoint, the old behavior will remain in place where all old blocks are added to prevBlocks and only released by calling shipped().

    opened by bbeaudreault 13
  • HBASE-27536: improve slowlog payload

    HBASE-27536: improve slowlog payload

    Related Jira: https://issues.apache.org/jira/browse/HBASE-27536

    I've loaded the server changes onto a test host and verified the following two states:

    • hbase.regionserver.slowlog.operation.json.enabled == false results in an empty string for operationJson
    • hbase.regionserver.slowlog.operation.json.enabled == true results in an operationJson as expected.

    Here's a picture of the entire OnlineLogEntry for a Mutate example: Screen Shot 2022-12-22 at 1 27 59 PM

    And here's a copy of the operationJson for a Scan example:

    {
      "filter": "PageFilter 25",
      "startRow": "P\\x00\\x00\\x00",
      "stopRow": "`\\x00\\x00\\x00",
      "batch": -1,
      "cacheBlocks": false,
      "totalColumns": 0,
      "maxResultSize": "4194304",
      "families": {},
      "caching": 2147483647,
      "maxVersions": 1,
      "timeRange": [
        "0",
        "9223372036854775807"
      ]
    }
    
    opened by rmdmattingly 16
Releases(rel/2.5.2)
  • rel/2.5.2(Dec 5, 2022)

    The HBase team is happy to announce the immediate availability of HBase 2.5.2.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database.

    Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 2.5.2 is the latest patch release in the HBase 2.5.x line. The full list of issues can be found in the included CHANGES and RELEASENOTES, or via our issue tracker:

    https://s.apache.org/hbase-2.5.2-jira
    

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html
    

    Notice that, starting from 2.5.2, we will publish dists and maven artifacts which are built against hadoop3. You can download the hadoop3 dists from the download page which has 'hadoop3-' prefix. And for maven artifacts, please use 2.5.2-hadoop3 as the version of the hbase dependencies in your pom.

    Questions, comments, and problems are always welcome at: [email protected].

    Thanks to all who contributed and made this release possible.

    Cheers, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/2.5.1(Oct 29, 2022)

    The HBase team is happy to announce the immediate availability of HBase 2.5.1.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database.

    Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 2.5.1 is the latest patch release in the HBase 2.5.x line. The full list of issues can be found in the included CHANGES and RELEASENOTES, or via our issue tracker:

    https://s.apache.org/hbase-2.5.1-jira
    

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html
    

    Questions, comments, and problems are always welcome at: [email protected].

    Thanks to all who contributed and made this release possible.

    Cheers, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/2.4.15(Oct 29, 2022)

    The HBase team is happy to announce the immediate availability of HBase 2.4.15.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database.

    Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 2.4.15 is the latest patch release in the HBase 2.4.x line. The full list of issues can be found in the included CHANGES and RELEASENOTES, or via our issue tracker:

    https://s.apache.org/hbase-2.4.15-jira
    

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html
    

    Questions, comments, and problems are always welcome at: [email protected].

    Thanks to all who contributed and made this release possible.

    Cheers, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/2.5.0(Sep 6, 2022)

    The HBase team is happy to announce the immediate availability of HBase 2.5.0.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database. Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 2.5.0 is the sixth minor release in the HBase 2.x line, which aims to improve the stability and reliability of HBase. This release includes nearly 300 resolved issues not covered by previous 2.x releases.

    Notable new features include:

    • Several performance improvements to the on-heap and off-heap data path
    • Several reliability and performance improvements to splitting and merging
    • Several compaction improvements
    • Overhaul of tracing based on OpenTelemetry
    • A region size distribution visualization on the Master UI
    • Expose the details of slow/large requests in Region Server UI
    • Several improvements to the HBase shell, and jshell support with JDK11+
    • Moved storage of meta region location out of ZooKeeper

    The full list of issues can be found in the included CHANGES.md and RELEASENOTES.md, or via our issue tracker:

    https://s.apache.org/hbase-2.5.0-jira
    

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html
    

    Questions, comments, and problems are always welcome at: [email protected].

    Thanks to all who contributed and made this release possible.

    Cheers, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/2.4.14(Sep 2, 2022)

    The HBase team is happy to announce the immediate availability of HBase 2.4.14.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database.

    Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 2.4.14 is the fourteenth patch release in the HBase 2.4.x line, which aims to improve the stability and reliability of HBase. This release includes 49 bug fixes and improvements since 2.4.13.

    The full list of issues and release notes can be found here:

    CHANGELOG: https://downloads.apache.org/hbase/2.4.14/CHANGES.md RELEASENOTES: https://downloads.apache.org/hbase/2.4.14/RELEASENOTES.md

    or via our issue tracker:

    https://issues.apache.org/jira/projects/HBASE/2.4.14-jiras

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html
    

    Questions, comments, and problems are always welcome at:

    [email protected]

    [email protected]

    Thanks to all who contributed and made this release possible.

    Cheers, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/1.7.2(Aug 10, 2022)

    The HBase team is happy to announce the immediate availability of HBase1.7.2.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database.

    Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 1.7.2 is the latest release of the HBase 1.7 line, continuing on the theme of bringing a stable, reliable database to the Apache Big Data ecosystem and beyond.

    1.7.2 includes ~57 bugs and improvement fixes done since 1.7.1. The full list of issues can be found here:

    https://s.apache.org/hbase-1.7.2-jira-release-notes

    And notice that 1.7.2 is the last release of the HBase 1.x line. Thank you for trusting and choosing the 1.x release line. In the future, users are encouraged to upgrade to our current stable release line 2.4.x.

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html
    

    For instructions on verifying ASF release downloads, please see

    https://www.apache.org/dyn/closer.cgi#verify
    

    Project member signature keys can be found at

    https://downloads.apache.org/hbase/KEYS
    

    Thanks to all the contributors who made this release possible!

    Best, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/2.4.13(Jul 4, 2022)

    The HBase team is happy to announce the immediate availability of HBase 2.4.13.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database.

    Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 2.4.13 is the thirteenth patch release in the HBase 2.4.x line. The full list of issues can be found in the included CHANGES and RELEASENOTES, or via our issue tracker:

    https://s.apache.org/hbase-2.4.13-jira
    

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html
    

    Questions, comments, and problems are always welcome at: [email protected].

    Thanks to all who contributed and made this release possible.

    Cheers, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/3.0.0-alpha-3(Jul 1, 2022)

    The HBase team is happy to announce the immediate availability of HBase 3.0.0-alpha-3.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database. Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 3.0.0-alpha-3 is the third alpha release in the HBase 3.x line, which is the third release of our next major release line. It includes roughly 3500+ resolved issues since 2.0.0.

    Notice that this is not a production ready release. It is used to let our users try and test the new major release, to get feedback before the final GA release is out. So please do NOT use it in production. Just try it and report back everything you find unusual.

    Notable new features include: Synchronous Replication OpenTelemetry Tracing Distributed MOB Compaction Backup and Restore Decouple region replication and general replication framework, and also make region replication can work when SKIP_WAL is used

    For other important changes: We do not support hadoop 2.x any more Move RSGroup balancer to core Reimplement sync client on async client CPEPs on shaded proto Move the logging framework from log4j to log4j2

    The full list of issues and release notes can be found here:

    CHANGELOG: https://downloads.apache.org/hbase/3.0.0-alpha-3/CHANGES.md RELEASENOTES: https://downloads.apache.org/hbase/3.0.0-alpha-3/RELEASENOTES.md

    or via our issue tracker:

    https://issues.apache.org/jira/projects/HBASE/versions/12350942 https://issues.apache.org/jira/projects/HBASE/versions/12350250 https://issues.apache.org/jira/projects/HBASE/versions/12332342

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html

    Questions, comments, and problems are always welcome at:

    [email protected] [email protected] [email protected]

    Thanks to all who contributed and made this release possible.

    Cheers, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/2.4.12(Jun 20, 2022)

  • rel/2.4.11(Mar 26, 2022)

    The HBase team is happy to announce the immediate availability of HBase 2.4.11.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database.

    Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 2.4.11 is the eleventh patch release in the HBase 2.4.x line. The full list of issues can be found in the included CHANGES and RELEASENOTES, or via our issue tracker:

    https://s.apache.org/hbase-2.4.11-jira
    

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html
    

    Questions, comments, and problems are always welcome at: [email protected].

    Thanks to all who contributed and made this release possible.

    Cheers, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/2.4.10(Mar 19, 2022)

    The HBase team is happy to announce the immediate availability of HBase 2.4.10.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database.

    Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 2.4.10 is the tenth patch release in the HBase 2.4.x line. The full list of issues can be found in the included CHANGES and RELEASENOTES, or via our issue tracker:

    https://s.apache.org/hbase-2.4.10-jira
    

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html
    

    Questions, comments, and problems are always welcome at: [email protected].

    Thanks to all who contributed and made this release possible.

    Cheers, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/2.4.9(Dec 25, 2021)

    The HBase team is happy to announce the immediate availability of HBase 2.4.9.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database.

    Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 2.4.9 is the ninth patch release in the HBase 2.4.x line. The full list of issues can be found in the included CHANGES and RELEASENOTES, or via our issue tracker:

    https://issues.apache.org/jira/projects/HBASE/versions/12350709
    

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html
    

    Questions, comments, and problems are always welcome at: [email protected].

    Thanks to all who contributed and made this release possible.

    Cheers, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/3.0.0-alpha-2(Dec 23, 2021)

    The HBase team is happy to announce the immediate availability of HBase 3.0.0-alpha-2.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database. Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 3.0.0-alpha-2 is the second alpha release in the HBase 3.x line, which is the second release of our next major release line. It includes roughly 3200+ resolved issues since 2.0.0.

    3.0.0-alpha-2 contains a critical security fix for addressing the log4j2 CVE-2021-44228. All users who already use 3.0.0-alpha-1 should upgrade to 3.0.0-alpha-2 ASAP.

    Notice that this is not a production ready release. It is used to let our users try and test the new major release, to get feedback before the final GA release is out. So please do NOT use it in production. Just try it and report back everything you find unusual.

    Notable new features include: Synchronous Replication OpenTelemetry Tracing Distributed MOB Compaction Backup and Restore

    For other important changes: We do not support hadoop 2.x any more Move RSGroup balancer to core Reimplement sync client on async client CPEPs on shaded proto Move the logging framework from log4j to log4j2

    The full list of issues and release notes can be found here:

    CHANGELOG: https://downloads.apache.org/hbase/3.0.0-alpha-2/CHANGES.md RELEASENOTES: https://downloads.apache.org/hbase/3.0.0-alpha-2/RELEASENOTES.md

    or via our issue tracker:

    https://issues.apache.org/jira/projects/HBASE/versions/12350250 https://issues.apache.org/jira/projects/HBASE/versions/12332342

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html

    Questions, comments, and problems are always welcome at:

    [email protected] [email protected] [email protected]

    Thanks to all who contributed and made this release possible.

    Cheers, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/2.4.8(Nov 4, 2021)

    The HBase team is happy to announce the immediate availability of HBase 2.4.8.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database.

    Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 2.4.8 is the seventh patch release in the HBase 2.4.x line. The full list of issues can be found in the included CHANGES and RELEASENOTES, or via our issue tracker:

    https://s.apache.org/hbase-2.4.8-jira
    

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html
    

    Questions, comments, and problems are always welcome at: [email protected].

    Thanks to all who contributed and made this release possible.

    Cheers, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/1.4.14(Oct 27, 2021)

    The HBase team is happy to announce the immediate availability of HBase1.4.14.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database.

    Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 1.4.14 is the latest release of the HBase 1.4 line, continuing on the theme of bringing a stable, reliable database to the Apache Big Data ecosystem and beyond.

    1.4.14 includes ~45 bugs and improvement fixes done since 1.4.13. The full list of issues can be found here:

    https://s.apache.org/hbase-1.4.14-jira-release-notes
    

    And notice that 1.4.14 is the last release of the HBase 1.4 line. Thank you for trusting and choosing the 1.4.x release line. In the future, users are encouraged to upgrade to our current stable release line 2.4.x.

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html
    

    For instructions on verifying ASF release downloads, please see

    https://www.apache.org/dyn/closer.cgi#verify
    

    Project member signature keys can be found at

    https://www.apache.org/dist/hbase/KEYS
    

    Thanks to all the contributors who made this release possible!

    Best, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/2.4.7(Oct 21, 2021)

    The HBase team is happy to announce the immediate availability of HBase 2.4.7.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database.

    Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 2.4.7 is the seventh patch release in the HBase 2.4.x line. The full list of issues can be found in the included CHANGES and RELEASENOTES, or via our issue tracker:

    https://s.apache.org/hbase-2.4.7-jira
    

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html
    

    Questions, comments, and problems are always welcome at: [email protected].

    Thanks to all who contributed and made this release possible.

    Cheers, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/2.3.7(Oct 26, 2021)

    The HBase team is happy to announce the immediate availability of HBase 2.3.7.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database.

    Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 2.3.7 is the seventh patch release in the HBase 2.3.x line. The full list of issues can be found in the included CHANGES and RELEASENOTES, or via our issue tracker:

    https://s.apache.org/hbase-2.3.7-jira
    

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html
    

    Questions, comments, and problems are always welcome at: [email protected].

    Thanks to all who contributed and made this release possible.

    Cheers, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/2.4.6(Sep 18, 2021)

    The HBase team is happy to announce the immediate availability of HBase 2.4.6.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database.

    Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 2.4.6 is the sixth patch release in the HBase 2.4.x line. The full list of issues can be found in the included CHANGES and RELEASENOTES, or via our issue tracker:

    https://s.apache.org/hbase-2.4.6-jira
    

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html
    

    Questions, comments, and problems are always welcome at: [email protected].

    Thanks to all who contributed and made this release possible.

    Cheers, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/2.4.5(Aug 3, 2021)

    The HBase team is happy to announce the immediate availability of HBase 2.4.5.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database.

    Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 2.4.5 is the fifth patch release in the HBase 2.4.x line. The full list of issues can be found in the included CHANGES and RELEASENOTES, or via our issue tracker:

    https://s.apache.org/hbase-2.4.5-jira

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html

    Questions, comments, and problems are always welcome at: [email protected].

    Thanks to all who contributed and made this release possible.

    Cheers, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/2.3.6(Aug 3, 2021)

    The HBase team is happy to announce the immediate availability of HBase 2.3.6.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database.

    Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 2.3.6 is the sixth patch release in the HBase 2.3.x line. The full list of issues can be found in the included CHANGES and RELEASENOTES, or via our issue tracker:

    https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12310753&version=12350029

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html

    Questions, comments, and problems are always welcome at: [email protected].

    Thanks to all who contributed and made this release possible.

    Cheers, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/3.0.0-alpha-1(Jul 22, 2021)

    The HBase team is happy to announce the immediate availability of HBase 3.0.0-alpha-1.

    Apache HBase™ is an open-source, distributed, versioned, non-relational database. Apache HBase gives you low latency random access to billions of rows with millions of columns atop non-specialized hardware. To learn more about HBase, see https://hbase.apache.org/.

    HBase 3.0.0-alpha-1 is the first alpha release in the HBase 3.x line, which is the first release of our next major release line. It includes roughly 3000+ resolved issues since 2.0.0.

    Notice that this is not a production ready release. It is used to let our users try and test the new major release, to get feedback before the final GA release is out. So please do NOT use it in production. Just try it and report back everything you find unusual.

    Notable new features include: Synchronous Replication OpenTelemetry Tracing Distributed MOB Compaction Backup and Restore

    For other important changes: We do not support hadoop 2.x any more Move RSGroup balancer to core Reimplement sync client on async client CPEPs on shaded proto Move the logging framework from log4j to log4j2

    The full list of issues and release notes can be found here:

    CHANGELOG: https://downloads.apache.org/hbase/3.0.0-alpha-1/CHANGES.md RELEASENOTES: https://downloads.apache.org/hbase/3.0.0-alpha-1/RELEASENOTES.md

    or via our issue tracker:

    https://issues.apache.org/jira/projects/HBASE/versions/12332342

    To download please follow the links and instructions on our website:

    https://hbase.apache.org/downloads.html

    Questions, comments, and problems are always welcome at:

    [email protected] [email protected] [email protected]

    Thanks to all who contributed and made this release possible.

    Cheers, The HBase Dev Team

    Source code(tar.gz)
    Source code(zip)
  • rel/1.7.1(Jul 22, 2021)

  • rel/2.4.4(Jun 20, 2021)

  • rel/2.4.3(Jun 11, 2021)

  • rel/2.4.2(Mar 20, 2021)

  • rel/2.4.1(Feb 25, 2021)

  • rel/2.4.0(Dec 30, 2020)

  • rel/2.3.3(Nov 2, 2020)

  • rel/2.3.2(Sep 25, 2020)

Owner
The Apache Software Foundation
The Apache Software Foundation
Mirror of Apache Cassandra

Apache Cassandra Apache Cassandra is a highly-scalable partitioned row store. Rows are organized into tables with a required primary key. Partitioning

The Apache Software Foundation 7.7k Jan 1, 2023
NoSQL data store using the seastar framework, compatible with Apache Cassandra

Scylla What is Scylla? Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a s

ScyllaDB 8.9k Dec 27, 2022
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.

Elephant Bird About Elephant Bird is Twitter's open source library of LZO, Thrift, and/or Protocol Buffer-related Hadoop InputFormats, OutputFormats,

Twitter 1.1k Jan 5, 2023
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning

Oryx 2 is a realization of the lambda architecture built on Apache Spark and Apache Kafka, but with specialization for real-time large scale machine l

Oryx Project 1.8k Dec 28, 2022
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning

Oryx 2 is a realization of the lambda architecture built on Apache Spark and Apache Kafka, but with specialization for real-time large scale machine l

Oryx Project 1.7k Mar 12, 2021
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning

Oryx 2 is a realization of the lambda architecture built on Apache Spark and Apache Kafka, but with specialization for real-time large scale machine l

Oryx Project 1.8k Dec 28, 2022
Equivalent Exchange 3 Apache 2 Equivalent Exchange 3 pahimar Equivalent-Exchange-3. Mods for Minecraft. License: Apache 2 , .

Welcome to Equivalent Exchange 3! All versions are available here Minecraft Forums page Compiling EE3 - For those that want the latest unreleased feat

Rob Davis 709 Dec 15, 2022
Apache Solr is an enterprise search platform written in Java and using Apache Lucene.

Apache Solr is an enterprise search platform written in Java and using Apache Lucene. Major features include full-text search, index replication and sharding, and result faceting and highlighting.

The Apache Software Foundation 630 Dec 28, 2022
FLiP: StreamNative: Cloud-Native: Streaming Analytics Using Apache Flink SQL on Apache Pulsar

StreamingAnalyticsUsingFlinkSQL FLiP: StreamNative: Cloud-Native: Streaming Analytics Using Apache Flink SQL on Apache Pulsar Running on NVIDIA XAVIER

Timothy Spann 5 Dec 19, 2021
Apache Cayenne is an open source persistence framework licensed under the Apache License

Apache Cayenne is an open source persistence framework licensed under the Apache License, providing object-relational mapping (ORM) and remoting services.

The Apache Software Foundation 284 Dec 31, 2022
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.

Sparkler A web crawler is a bot program that fetches resources from the web for the sake of building applications like search engines, knowledge bases

USC Information Retrieval & Data Science 396 Dec 17, 2022
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads

Apache ORC ORC is a self-describing type-aware columnar file format designed for Hadoop workloads. It is optimized for large streaming reads, but with

The Apache Software Foundation 576 Jan 2, 2023
Apache Calcite

Apache Calcite Apache Calcite is a dynamic data management framework. It contains many of the pieces that comprise a typical database management syste

The Apache Software Foundation 3.6k Dec 31, 2022
Apache Druid: a high performance real-time analytics database.

Website | Documentation | Developer Mailing List | User Mailing List | Slack | Twitter | Download Apache Druid Druid is a high performance real-time a

The Apache Software Foundation 12.3k Jan 1, 2023
Mirror of Apache Deltaspike

Apache DeltaSpike Documentation Mailing Lists Contribution Guide JIRA Apache License v2.0 Apache DeltaSpike is a suite of portable CDI Extensions inte

The Apache Software Foundation 141 Jan 1, 2023
Apache Geode

Contents Overview How to Get Apache Geode Main Concepts and Components Location of Directions for Building from Source Geode in 5 minutes Application

The Apache Software Foundation 2.2k Dec 30, 2022
Apache Wicket - Component-based Java web framework

What is Apache Wicket? Apache Wicket is an open source, java, component based, web application framework. With proper mark-up/logic separation, a POJO

The Apache Software Foundation 657 Dec 31, 2022
Apache Dubbo is a high-performance, java based, open source RPC framework.

Apache Dubbo Project Apache Dubbo is a high-performance, Java-based open-source RPC framework. Please visit official site for quick start and document

The Apache Software Foundation 38.3k Jan 9, 2023