¿Qué es un Data Grid?

06/07/2022 Rodrigo Alonso Aviles

In-Memory Data Grid es un concepto que facilita el caché creando una capa de almacenamiento de datos de alta velocidad que almacena información de manera volátil en memoria. Gracias a ello, podemos evitar demoras en tiempo de consulta o de cálculos complejos que han sido previamente realizados.

Existen dos tipos de caché:

Caché local: se encuentra contenida dentro de la propia aplicación.
Caché de sistemas distribuidos: se encuentra desplegada en uno o varios nodos formando una gran caché lógica, compartiendo distintas estructuras de datos de con una flexibilidad cómoda.

A continuación vamos a ver uno de los principales productos de Data Grid del mercado: Hazelcast.

Hazelcast

Se trata de un sistema de almacenamiento distribuido de código abierto cuyo fundamento principal es compartir distintas estructuras de datos de forma distribuida con una escalabilidad flexible y cómoda. Los datos son distribuidos entre los nodos de manera balanceada, pero siendo tolerante a fallos frente a la caída de nodos.

Hazelcast IMDG en la Onesait Platform

Arquitectura

Despliegue y configuración de los nodos para trabajar con nuestros orígenes de datos.
Cliente con el cual nos comunicamos y configura los nodos.
Microservicios: Desplegamos JET, trabaja de manera similar y nos permite embeber módulos en cada uno de los microservicios.

Despliegues

Docker Cloud

Levantar instancias de Hazelcast dentro de contenedores docker de forma independiente o como clúster. Podremos levantar un cliente para su gobierno o el «Hazelcast Management center».

Docker & AWS / AzureD

Funcionamiento igual al anterior, pero desplegando los contenedores en cualquier plataforma cloud que nos interese.

Kubernetes & Docker Swarm

Podremos gobernar Hazelcast mediante Kubernetes y Docker Swarm, siendo este caso el recomendado para usar con Azure.

Usos

¿Y qué utilidad tiene todo esto? Veamos algunos ejemplos de uso:

Compartir datos y trabajo entre varios servidores.
Caché distribuido de datos.
Comunicación segura entre servidores.
Particionado de datos en memoria.
Procesamiento paralelo.
Gestión fail-safe de datos.
Lanzar consultas complejas contra los datos cacheados.
Distribuir la carga.
Streamer en tiempo real para la detección del rendimiento.
Almacenamiento de datos de sesión en aplicación Web.

Hazelcast JET

Trabaja de manera dinámica y en tiempo real con streams de datos, y nos ofrece la posibilidad de embeber cada nodo en aplicaciones y microservicios. Su uso se recomienda cuando nos encontramos ante grandes volúmenes de datos y la necesidad de tiempo real. Nos ofrece un motor de baja latencia y una alta tasa de transferencia.

Algunas características

Baja latencia end-to-end.
Hazelcast IMDG embebido.
Simple de usar: un único JAR sin dependencias.
Ligero y embebible.
Despliegue en cloud.

Usos Jet

Recomendable para su uso en microservicios.
Conexión con DataBrokers.
Integración y deployment sencillo.
Soporte para Big Data y IoT.

Imagen de cabecera: Julius Drost en Unsplash

✍🏻 Author(s)

Rodrigo Alonso Aviles

See author's posts

Cookie	Duración	Descripción
__cfruid	session	Cloudflare sets this cookie to identify trusted web traffic.
connect.sid	1 day	This cookie is used for authentication and for secure log-in. It registers the log-in information.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
JSESSIONID	session	The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duración	Descripción
pll_language	1 year	The pll _language cookie is used by Polylang to remember the language selected by the user when returning to the website, and also to get the language information when not available in another way.
ugid	1 year	This cookie is set by the provider Unsplash. This cookie is used for enabling the video content on the website.

Cookie	Duración	Descripción
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_127650363_5	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.

Cookie	Duración	Descripción
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.

Cookie	Duración	Descripción
atlassian.account.ffs.id	1 year	No description available.
atlassian.account.xsrf.token	session	No description available.
cloud.session.token	past	No description
pvc_visits[0]	1 hour	This cookie is created by post-views-counter. This cookie is used to count the number of visits to a post. It also helps in preventing repeat views of a post by a visitor.
SESSION	session	No description