I want something that has a WebUI, can show in a graph like the CPU and RAM graph for this day and maybe some days before. Also I would like to view what was running at any given time (I mean from 2-3 days before to now).

Is there any (FOSS) software that does that?

Thanks.

  • model_tar_gz@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    2 months ago

    I work for a large enterprise and build ML model monitoring pipelines fairly frequently—this will be a more in depth but similar use case to what you’re asking.

    We use Grafana (visualization) and Prometheus (timeseries db)—they’re built for this use case exactly. Tons of info out there on how to build, configure, connect to your sensors, and deploy it.

  • Hercules@lemmy.world
    link
    fedilink
    English
    arrow-up
    33
    arrow-down
    2
    ·
    2 months ago

    I think prometheus + grafana might be what you are looking for. In combination with loki grafana can also be used for viewing log messages.

    • Im_old@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      2 months ago

      Absolutely this, nothing else is required. Well, maybe alertmanager if you want to receive alerts

      • farcaller@fstab.sh
        link
        fedilink
        English
        arrow-up
        5
        ·
        2 months ago

        and swap Prometheus for VictoriaMertics, or your homelab ram usage becomes Prometheus ram usage.

    • N1ghtstalk3r@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      1
      ·
      2 months ago

      I’ll second this. Prometheus + Grafana is what I’m using now, but you can definitely add more extensions/monitors to get far more detail, like Loki which was suggested above.

      • miau@lemmy.sdf.org
        link
        fedilink
        English
        arrow-up
        4
        ·
        2 months ago

        Grafana is just the frontend, its a dashboard for your different data sources Prometheus is the “database”, it scrapes data from your endpoints over http

    • mumblerfish@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 months ago

      Which parts are OpenTelemetry for? Is Prometheus Agent, Prometheus Server and Grafana not enough?

      • ohlaph@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        2 months ago

        I like it because I use it for MELT in general. Prometheus generally does metrics and if you want to include logs, traces and events, it becomes more cumbersome. With the Otel collector, I can just update my collector configuration to point to the various services.

        I’m not saying OP can’t use what you suggested, just stating what I would use.

    • whoisearth@lemmy.ca
      link
      fedilink
      English
      arrow-up
      4
      ·
      2 months ago

      If you’re serious about monitoring your shit this is really the best answer. Zabbix is love. Zabbix is life.

  • Rizilia@lemmy.zip
    link
    fedilink
    English
    arrow-up
    2
    ·
    2 months ago

    I recently start using Observium for some basic monitoring. I’m happy so far.

  • azron@lemmy.ml
    link
    fedilink
    English
    arrow-up
    2
    ·
    2 months ago

    Munin is a tried and true solution. It installs on the server creates graphs and makes it easy to see a stair step graph to problems like out of memory.

    I’d also highly recommend installing atop and having it collect stats every 1 to 2 minutes. You can go to a crashed server and step through what was running in a “top” like interfsce. I install atop on any server as a means for post incident diagnosis.

  • 𝕸𝖔𝖘𝖘@infosec.pub
    link
    fedilink
    English
    arrow-up
    4
    ·
    2 months ago

    We use libreNMS. Its docs state that it will do this, but we only use the uptime monitoring feature, so I can’t arrest as to how well it will monitor everything else.

    • Luckyfriend222@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      2 months ago

      I use this too. When SNMP is set up there are loads of things you can monitor with LibreNMS. Much less of a learning curve than Grafana + Prometheus, although the latter probably has some nice tweaks available that SNMP does not provide.

  • themoonisacheese@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    3
    arrow-down
    2
    ·
    2 months ago

    Can’t really go wrong with the old school nagios+thruk. The learning curve is a tad steep but it teaches you a lot of things about your systems.

    • Evotech@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      2 months ago

      Nagios is really not great imo. It’s very not modern.

      But if you insist on Nagios at least do like. Icinga or spmething

  • Mora@pawb.social
    link
    fedilink
    English
    arrow-up
    6
    ·
    edit-2
    2 months ago

    I’ve recently found Beszel and i want to use it to replace my grafana/Prometheus/node exporter stack. It seems to be a rather easy & clean solution. Sure, you can do more with grafana and Prometheus but I can’t be bothered having to learn that, when all I want is some simple monitoring.

    https://github.com/henrygd/beszel

  • agile_squirrel@lemmy.ml
    link
    fedilink
    English
    arrow-up
    3
    ·
    2 months ago

    You’ve already received some great suggestions. Another one is Netdata. Personally, I use glances to collect the data and Home Assistant to display the dashboard. But I only do this because I already had Home Assistant running.