Big Data

Big data is a blanket term for the non-traditional strategies and technologies needed to organize, process, and gather insights from large datasets. Many users and organizations are turning to big data for certain types of workloads, and using it to supplement their existing analysis and business tools. Tools that exist in this space offer different options for interpolating data into a system, storing it, analyzing it, and working with it through visualizations.

featured tutorialHadoop, Storm, Samza, Spark, and Flink: Big Data Frameworks Compared
Big data is a blanket term for the non-traditional strategies and technologies needed to gather, organize, process, a...

Justin Ellingwood • Published on October 13, 2016 · Updated on October 28, 2016

featured tutorialAn Introduction to Big Data Concepts and Terminology
Big data is a blanket term for the non-traditional strategies and technologies needed to gather, organize, process, a...

Justin Ellingwood • Published on September 28, 2016

Subscribe to Big Data
Get notified when new articles on Big Data are published.
    All
  • 20 Results
    • Question

      Build ETL with pipeline in DigitalOcean

      Hi guys, I need to build a ETL for a company. Does anyone know any library/APPI/tool to create this ELT quickly and easily?. Thank you a lot! Leonardo Pérez
      1 answer1 month agoBy leonet10Big Data
    • Question

      Upgrading to multi core

      When I upgrade to a multi-core (from 1 vCPU to 2 vCPU) do I need to upgrade anything (besides my code) to take advantage of 2 vCPU? If I simply resized my droplet without a brand new Ubuntu install, if my python singl...
      1 answer1 month agoBy CodingMonkeyApplicationsBig DataData AnalysisPostgreSQLPythonSystem ToolsUbuntu
    • Tutorial

      How To Install Hadoop in Stand-Alone Mode on Ubuntu 20.04

      In this tutorial, you’ll learn how to install Hadoop in stand-alone mode on an Ubuntu 20.04 server. You’ll also run an example MapReduce program to search for occurrences of a regular expression in text files.
      3 months agoBy Tony Tran, Hanif JethaBig DataClusteringUbuntuUbuntu 20.04
    • Question

      What is the digital ocean plan for this scenario

      So basically I building an app where users to search for services. The backend (node.js) would be hosted on digital ocean together with the database (MongoDB) There would be customer accounts and accounts for the serv...
      1 answer5 months agoBy udbasiliSquidNode.jsNginxDigitalOcean App PlatformBig DataBillingDigitalOcean
    • Question

      Database for 1m users at once

      I want to know, if database hosting is the right for me. Normally i have smaller apps, but with this app i will get >20k users with each paying user. And all users in my app will act in the same time. Example:You have...
      1 answer6 months agoBy PixelairportDatabasesBig Data
    • Question

      Difficulty in Accessing MongoDB database hosted on digitalocean through third party ETL tools.

      Difficulty in Accessing MongoDB database hosted on digitalocean through third party ETL tools.I was trying to connect my MongoDB database hosted on DigitalOcean through third party ETL tools. This is to serve as a dat...
      1 answer7 months agoBy ayomidefadeyi1MongoDBDigitalOcean Managed MongoDB DatabaseBig Data
    • Question

      Recommended Droplet configuration for WebScrapping

      Hello everyone!I’m new to Digital Ocean and Im really in the need of learning more. I have a project that does several requests for serveral servers super constantly i.e imagine having a python script that has around ...
      Accepted Answer: Hi there, I believe that the Droplet that you’ve selected is quite good for a start. I could suggest regularly checking the monitoring graphs via your DigitalOcean Control panel and see how the resources are being uti...
      1 answer1 year agoBy quirozvalandresDebianBig DataBuilding on DigitalOceanNetworkingDigitalOcean DropletsUbuntu 20.04
    • Question

      Can i trust Digital ocean ?

      i want to run some important things in Do which require high uptime can i trust DO ?
      2 answers1 year agoBy anirudhahikoka404Big Data
    • Question

      How to fix 502 Bad Gateway error message?

      I was trying to upload a data file of around 600 MB in my project which is hosted in digital ocean. It tries to upload but 502 Bad Gateway Nginx error is shown. While the upload completely works fine on my local syste...
      2 answers1 year agoBy rikeshk012330NginxBig DataDigitalOceanPythonDigitalOcean Droplets
    • Tutorial

      What is Big Data?

      Big data is a blanket term for the non-traditional strategies and technologies needed to organize, process, and gather insights from large datasets. Many users and organizations are turning to big data for certain typ...
      1 year agoBy Brian BoucheronGlossaryBig Data
    • Question

      AI - Time series analysis with which DBMS?

      Hi guys, does anyone deal with AI? I recently visited an online forum on AI, because I’m very interested in it and I’m thinking about taking this direction professionally - neuroinformatics and artificial intelligence...
      No answers yet2 years agoBy diggieFishDatabasesBig DataData Analysis
    • Tutorial

      How To Install and Use ClickHouse on Ubuntu 20.04

      ClickHouse is an open source, column-oriented analytics database created by Yandex for OLAP and big data use cases. In this tutorial, you’ll install the ClickHouse database server and client on your machine. You’ll us...
      2 years agoBy bsderBig DataDatabasesUbuntu 20.04
    • Question

      How do I manually calculate average session duration?

      Hallo guys, can anyone help me tell the exact formula of the average session duration where to get from? I’m having trouble finding a manual calculation of the average session duration For example, from the google ana...
      1 answer2 years agoBy rifulabyssalData AnalysisDigitalOcean ArticlesBig DataGraphQL
    • Question

      Showing as Read only Disk

      As I am running my own mail server now since I restarted my Droplet I am not able to write anything or Delete anything from my Droplet. When I log in to droplet using SSH Putty and try to make any changes, I can see c...
      1 answer2 years agoBy saddamhussainfeaApacheBig DataBackupsArch Linux
    • Question

      Can you send data to a server through cellular?

      Hello all. At my current job we have sensors on one of our building’s roofs that sends environmental data from the roof to a physical server in the building. This system is proprietary and our building does not allow ...
      1 answer3 years agoBy csmall9ApplicationsDatabasesOpen SourceBig DataConceptual
    • Question

      CPU Optimized Droplet works very slow

      Hello, I was using a CPU Optimized Droplet. At first month it worked very fast with a low amount of information for example a process that I developed, it took a maximum of 2 minutes to to deliver results. But the sec...
      2 answers3 years agoBy Andres RamosDigitalOceanBig Data
    • Question

      What is Quantum computing ?

      What is quantum computing and future of quantum computing
      1 answer3 years agoBy chandu fvlBig DataDatabasesDigitalOceanMachine Learning
    • Question

      I would like to max my cpu usage for foreseeable future. Can I?

      I need to do math. Math is hard. I would use 100% CPU for foreseeable future. Is this allowed? I have old account and all is prepaid. I would use a new 3 CPU 15$ droplet.
      2 answers3 years agoBy DigitalOceana234400ef21fe9DigitalOceanBig Data
    • Tutorial

      How To Install and Use ClickHouse on Debian 9

      ClickHouse is an open-source, column-oriented analytics database created by Yandex for OLAP and big data use cases. In this tutorial, you’ll install the ClickHouse database server and client on your machine. You’ll us...
      3 years agoBy bsderDatabasesData AnalysisBig DataDebian 9
    • Question

      Getting account unblocked

      Hey guys, new here. Been using Digital Ocean off and on for the last few years, mostly for personal projects and have really liked it a lot. Recently, though I started using it for Big Data processes. What I do is spi...
      Accepted Answer: Hey friend, I’d like to explain a bit about the reason for this, and the thoughts behind it. Please know that I’m about to say a lot of things that may not be relevant to you. It isn’t necessarily that crypto is again...
      3 answers3 years agoBy John WalzDigitalOceanBig DataUbuntu 18.04