hassio-addons/addon-influxdb

InfluxDB stops every couple of hours, with no visible reason #72

pejotigrek posted onGitHub

Problem/Motivation

InfluxDB stopping and therefore I lose my data, as I'm not always aware of the stop.

Expected behavior

I want my InfluxDB to run without issues.

Actual behavior

InfluxDB is running for some time, and then randomly stops without notice. Log attached.

Steps to reproduce

Run InfluxDB, act normal & "forget about HomeAssistant" as the automations do stuff at home. then, after few hours check the supervisor and see that InfluxDB stopped working some time ago.

Proposed changes

honestly - if I knew what to do, I probably would fix it myself :(

Logs copied from addon's dashboard:

[s6-init] making user provided files available at /var/run/s6/etc...exited 0.
[s6-init] ensuring user provided files have correct perms...exited 0.
[fix-attrs.d] applying ownership & permissions fixes...
[fix-attrs.d] done.
[cont-init.d] executing container initialization scripts...
[cont-init.d] 00-banner.sh: executing... 

-----------------------------------------------------------
 Hass.io Add-on: InfluxDB
 Scalable datastore for metrics, events, and real-time analytics
-----------------------------------------------------------
 Add-on version: 3.6.1
 You are running the latest version of this add-on.
 System: Ubuntu 18.04.4 LTS  (amd64 / qemux86-64)
 Home Assistant Core: 0.106.6
 Home Assistant Supervisor: 209
-----------------------------------------------------------
 Please, share the above information when looking for help
 or support in, e.g., GitHub, forums or the Discord chat.
-----------------------------------------------------------
[cont-init.d] 00-banner.sh: exited 0.
[cont-init.d] 01-log-level.sh: executing... 
[cont-init.d] 01-log-level.sh: exited 0.
[cont-init.d] create-users.sh: executing... 
[cont-init.d] create-users.sh: exited 0.
[cont-init.d] influxdb.sh: executing... 
[13:24:40] INFO: Reporting of usage stats to InfluxData is disabled.
[cont-init.d] influxdb.sh: exited 0.
[cont-init.d] kapacitor.sh: executing... 
[cont-init.d] kapacitor.sh: exited 0.
[cont-init.d] nginx.sh: executing... 
[cont-init.d] nginx.sh: exited 0.
[cont-init.d] done.
[services.d] starting services
[services.d] done.
[13:24:41] INFO: Kapacitor is waiting until InfluxDB is available...
[13:24:41] INFO: Chronograf is waiting until InfluxDB is available...
[13:24:41] INFO: Starting the InfluxDB...
[13:25:20] INFO: Starting Chronograf...
time="2020-03-24T13:25:35+01:00" level=info msg="Reporting usage stats" component=usage freq=24h reporting_addr="https://usage.influxdata.com" stats="os,arch,version,cluster_id,uptime"
time="2020-03-24T13:25:35+01:00" level=info msg="Serving chronograf at http://127.0.0.1:8889" component=server
[13:25:35] INFO: Starting NGinx...
[22:39:24] WARNING: Kapacitor is done waiting. Starting anyway...
[22:39:24] INFO: Starting the Kapacitor

'##:::'##::::'###::::'########:::::'###:::::'######::'####:'########::'#######::'########::
 ##::'##::::'## ##::: ##.... ##:::'## ##:::'##... ##:. ##::... ##..::'##.... ##: ##.... ##:
 ##:'##::::'##:. ##:: ##:::: ##::'##:. ##:: ##:::..::: ##::::: ##:::: ##:::: ##: ##:::: ##:
2020/03/24 22:39:24 Using configuration at: /etc/kapacitor/kapacitor.conf
 #####::::'##:::. ##: ########::'##:::. ##: ##:::::::: ##::::: ##:::: ##:::: ##: ########::
 ##. ##::: #########: ##.....::: #########: ##:::::::: ##::::: ##:::: ##:::: ##: ##.. ##:::
 ##:. ##:: ##.... ##: ##:::::::: ##.... ##: ##::: ##:: ##::::: ##:::: ##:::: ##: ##::. ##::
 ##::. ##: ##:::: ##: ##:::::::: ##:::: ##:. ######::'####:::: ##::::. #######:: ##:::. ##:
..::::..::..:::::..::..:::::::::..:::::..:::......:::....:::::..::::::.......:::..:::::..::

ts=2020-03-24T22:39:24.840+01:00 lvl=error msg="encountered error" service=run err="open server: open service *influxdb.Service: failed to link subscription on startup: authorization failed"
run: open server: open service *influxdb.Service: failed to link subscription on startup: authorization failed
[cont-finish.d] executing container finish scripts...
[cont-finish.d] 99-message.sh: executing... 
[cont-finish.d] 99-message.sh: exited 0.
[cont-finish.d] done.
[s6-finish] waiting for services.
[s6-finish] sending all processes the TERM signal.

:wave: Thanks for opening your first issue here! If you're reporting a :bug: bug, please make sure you include steps to reproduce it. Also, logs, error messages and information about your hardware might be useful.

posted by addons-assistant[bot] about 5 years ago

There isn't much here to work from. The only thing I can come up with is a memory issue. If there is not enough memory left on your device, it will start killing processes.

posted by frenck about 5 years ago

hmm. HP t620 with 4 gigs, about 1.6GB free. glances shows that influxdb container uses about 480MB, but also... 128% of cpu ;) [max, min is about 75%].

I guess there's no other way to see what causes the error? some kind of "master log file" hidden somewhere? ;)

posted by pejotigrek about 5 years ago

ok, lasted few days as I waited for RAM shipping but - I've added another 4 gigs. so right now I've got 8GB of RAM, there's no way it's going low [about 22-25% used]. influx still goes down every couple of hours.

last log before database down:

[s6-init] making user provided files available at /var/run/s6/etc...exited 0.
[s6-init] ensuring user provided files have correct perms...exited 0.
[fix-attrs.d] applying ownership & permissions fixes...
[fix-attrs.d] done.
[cont-init.d] executing container initialization scripts...
[cont-init.d] 00-banner.sh: executing... 

-----------------------------------------------------------
 Hass.io Add-on: InfluxDB
 Scalable datastore for metrics, events, and real-time analytics
-----------------------------------------------------------
 Add-on version: 3.6.1
 There is an update available for this add-on!
 Latest add-on version: null
 Please consider upgrading as soon as possible.
 System: Ubuntu 18.04.4 LTS  (amd64 / qemux86-64)
 Home Assistant Core: 0.107.7
 Home Assistant Supervisor: 214
-----------------------------------------------------------
 Please, share the above information when looking for help
 or support in, e.g., GitHub, forums or the Discord chat.
-----------------------------------------------------------
[cont-init.d] 00-banner.sh: exited 0.
[cont-init.d] 01-log-level.sh: executing... 
[cont-init.d] 01-log-level.sh: exited 0.
[cont-init.d] create-users.sh: executing... 
[cont-init.d] create-users.sh: exited 0.
[cont-init.d] influxdb.sh: executing... 
[15:37:10] INFO: Reporting of usage stats to InfluxData is disabled.
[cont-init.d] influxdb.sh: exited 0.
[cont-init.d] kapacitor.sh: executing... 
[cont-init.d] kapacitor.sh: exited 0.
[cont-init.d] nginx.sh: executing... 
[cont-init.d] nginx.sh: exited 0.
[cont-init.d] done.
[services.d] starting services
[services.d] done.
[15:37:11] INFO: Kapacitor is waiting until InfluxDB is available...
[15:37:11] INFO: Chronograf is waiting until InfluxDB is available...
[15:37:11] INFO: Starting the InfluxDB...
[15:38:02] INFO: Starting Chronograf...
time="2020-03-30T15:38:17+02:00" level=info msg="Serving chronograf at http://127.0.0.1:8889" component=server
time="2020-03-30T15:38:17+02:00" level=info msg="Reporting usage stats" component=usage freq=24h reporting_addr="https://usage.influxdata.com" stats="os,arch,version,cluster_id,uptime"
[15:38:18] INFO: Starting NGinx...
[00:48:38] WARNING: Kapacitor is done waiting. Starting anyway...
[00:48:39] INFO: Starting the Kapacitor

'##:::'##::::'###::::'########:::::'###:::::'######::'####:'########::'#######::'########::
 ##::'##::::'## ##::: ##.... ##:::'## ##:::'##... ##:. ##::... ##..::'##.... ##: ##.... ##:
 ##:'##::::'##:. ##:: ##:::: ##::'##:. ##:: ##:::..::: ##::::: ##:::: ##:::: ##: ##:::: ##:
2020/03/31 00:48:39 Using configuration at: /etc/kapacitor/kapacitor.conf
 #####::::'##:::. ##: ########::'##:::. ##: ##:::::::: ##::::: ##:::: ##:::: ##: ########::
 ##. ##::: #########: ##.....::: #########: ##:::::::: ##::::: ##:::: ##:::: ##: ##.. ##:::
 ##:. ##:: ##.... ##: ##:::::::: ##.... ##: ##::: ##:: ##::::: ##:::: ##:::: ##: ##::. ##::
 ##::. ##: ##:::: ##: ##:::::::: ##:::: ##:. ######::'####:::: ##::::. #######:: ##:::. ##:
..::::..::..:::::..::..:::::::::..:::::..:::......:::....:::::..::::::.......:::..:::::..::

ts=2020-03-31T00:48:39.582+02:00 lvl=error msg="encountered error" service=run err="open server: open service *influxdb.Service: failed to link subscription on startup: authorization failed"
run: open server: open service *influxdb.Service: failed to link subscription on startup: authorization failed
[cont-finish.d] executing container finish scripts...
[cont-finish.d] 99-message.sh: executing... 
[cont-finish.d] 99-message.sh: exited 0.
[cont-finish.d] done.
[s6-finish] waiting for services.
[s6-finish] sending all processes the TERM signal.

log after restarting:

[s6-init] making user provided files available at /var/run/s6/etc...exited 0.
[s6-init] ensuring user provided files have correct perms...exited 0.
[fix-attrs.d] applying ownership & permissions fixes...
[fix-attrs.d] done.
[cont-init.d] executing container initialization scripts...
[cont-init.d] 00-banner.sh: executing... 

-----------------------------------------------------------
 Hass.io Add-on: InfluxDB
 Scalable datastore for metrics, events, and real-time analytics
-----------------------------------------------------------
 Add-on version: 3.6.1
 There is an update available for this add-on!
 Latest add-on version: null
 Please consider upgrading as soon as possible.
 System: Ubuntu 18.04.4 LTS  (amd64 / qemux86-64)
 Home Assistant Core: 0.107.7
 Home Assistant Supervisor: 214
-----------------------------------------------------------
 Please, share the above information when looking for help
 or support in, e.g., GitHub, forums or the Discord chat.
-----------------------------------------------------------
[cont-init.d] 00-banner.sh: exited 0.
[cont-init.d] 01-log-level.sh: executing... 
[cont-init.d] 01-log-level.sh: exited 0.
[cont-init.d] create-users.sh: executing... 
[cont-init.d] create-users.sh: exited 0.
[cont-init.d] influxdb.sh: executing... 
[11:06:22] INFO: Reporting of usage stats to InfluxData is disabled.
[cont-init.d] influxdb.sh: exited 0.
[cont-init.d] kapacitor.sh: executing... 
[cont-init.d] kapacitor.sh: exited 0.
[cont-init.d] nginx.sh: executing... 
[cont-init.d] nginx.sh: exited 0.
[cont-init.d] done.
[services.d] starting services
[services.d] done.
[11:06:23] INFO: Kapacitor is waiting until InfluxDB is available...
[11:06:23] INFO: Chronograf is waiting until InfluxDB is available...
[11:06:23] INFO: Starting the InfluxDB...
[11:07:03] INFO: Starting Chronograf...
time="2020-03-31T11:07:18+02:00" level=info msg="Reporting usage stats" component=usage freq=24h reporting_addr="https://usage.influxdata.com" stats="os,arch,version,cluster_id,uptime"
time="2020-03-31T11:07:18+02:00" level=info msg="Serving chronograf at http://127.0.0.1:8889" component=server
[11:07:18] INFO: Starting NGinx...

is there really nothing I can do? :(

posted by pejotigrek about 5 years ago

problem fixed by uninstalling influxdb-addon & by installing influxdb as standalone, not docker-based.

I'm guessing that some combination of influxdb-docker-image and docker itself isn't playing along [probably not docker or influxdb-docker-image by themselves, as other addons/containers work well].

posted by pejotigrek about 5 years ago

This thread has been automatically locked because it has not had recent activity. Please open a new issue for related bugs and link to relevant comments in this thread.

posted by addons-assistant[bot] almost 5 years ago

Fund this Issue

$0.00
Funded

Pull requests