yuanbiao/emqx - emqx

Commit Graph

Author	SHA1	Message	Date
Erik Timan	8836494542	fix: redact influxdb tokens in a few logs	2023-01-25 14:48:32 +01:00
Andrew Mayorov	903a77b471	test(redis): ensure batch query hit different cluster shards This will inevitably fail: it's not generally possible to update different keys through the same cluster connection, one or more update will fail with `MOVED` status. This testcase should serve as a regression test later.	2023-01-25 15:33:05 +03:00
Zaiming (Stone) Shi	5fdf7fd24c	fix(kafka): use async callback to bump success counters some telemetry events from wolff are discarded: * dropped: this is double counted in wolff, we now only subscribe to the dropped_queue_full event * retried_failed: it has different meanings in wolff, in wolff, it means it's the 2nd (or onward) produce attempt in EMQX, it means it's eventually failed after some retries * retried_success since we are going to handle the success counters in callbac this having this reported from wolff will only make things harder to understand * failed wolff never fails (unelss drop which is a different counter)	2023-01-24 21:12:36 +01:00
Zaiming (Stone) Shi	feb0238911	fix(influxdb): ensure client is stopped before returning error	2023-01-24 20:35:02 +01:00
Erik Timan	7603ab490b	fix(emqx_ee_connector): fix passing of influxdb ssl opts	2023-01-24 14:30:40 +01:00
Erik Timan	28718edbfd	chore: bump application VSNs	2023-01-24 14:12:34 +01:00
Zaiming (Stone) Shi	8fde169abb	Merge pull request #9821 from thalesmg/buffer-worker-expiry-v50 feat(buffer_worker): add expiration time to requests	2023-01-24 13:54:04 +01:00
Thales Macedo Garitezi	ca4a262b75	refactor: re-organize dealing with unrecoverable errors	2023-01-20 12:00:17 -03:00
Thales Macedo Garitezi	6fa6c679bb	feat(buffer_worker): add expiration time to requests With this, we avoid performing work or replying to callers that are no longer waiting on a result. Also introduces two new counters: - `dropped.expired` :: happens when a request expires before being sent downstream - `late_reply` :: when a response is receive from downstream, but the caller is no longer for a reply because the request has expired, and the caller might even have retried it.	2023-01-20 11:36:52 -03:00
Kjell Winblad	8c52264c41	fix: atom leak when doing Kafka bridge dry-run A new atom was created every time one did a dry run of a Kafka bridge (that is, clicking the Test button in the settings dialog for the bridge). After this fix, we will only create a new atom when a bridge with a new name is created. This should be acceptable as bridges with new names are created relatively rarely and it seems to be useful to have an unique atom for every Wolff producer. Fixes: https://emqx.atlassian.net/browse/EMQX-8739	2023-01-20 11:50:20 +01:00
Thales Macedo Garitezi	a5424959c6	refactor: avoid operator precedence bugs	2023-01-18 16:15:34 -03:00
Thales Macedo Garitezi	47f796dd12	refactor: rename `emqx_resource_worker` -> `emqx_resource_buffer_worker` To make it more clear that it's purpose is serve as a buffering layer.	2023-01-18 16:15:34 -03:00
Ilya Averyanov	f9843de7ae	Merge pull request #9628 from savonarola/fix-flaky-redis-bridge-test chore(ee bridge): fix Redis bridge test flakyness	2023-01-18 20:56:13 +02:00
Zaiming (Stone) Shi	caf50d1b35	Merge pull request #9792 from savonarola/redis-bridge-examples chore(bridges): add more helpful descriptions	2023-01-18 18:01:35 +01:00
Ilya Averyanov	f9c3115cd4	chore(bridges): add more helpful descriptions Add more helpful descriptions for `command_template` option of Redis bridge.	2023-01-18 18:17:08 +02:00
Zaiming (Stone) Shi	7abba17b25	Merge pull request #9765 from zmstone/0115-add-password-converter fix(schema): add password converter to ensure its binary() type	2023-01-18 15:09:05 +01:00
Zaiming (Stone) Shi	2d01e604a5	Merge pull request #9799 from zmstone/0118-fix-key_dispatch-kafka-produce-strategy 0118 fix key dispatch kafka produce strategy	2023-01-18 13:52:49 +01:00
Ilya Averyanov	f6fbbf3ee3	chore(bridges): reduce Redis bridge flakyness	2023-01-18 14:34:11 +02:00
Zaiming (Stone) Shi	8f275a66d0	test: add coverage for key_dispatch partition strategy	2023-01-18 11:47:37 +01:00
Zaiming (Stone) Shi	d4f3b4c8c2	Merge remote-tracking branch 'origin/master' into fix-buffer-clear-replayq-on-delete-v50	2023-01-18 11:39:47 +01:00
Zaiming (Stone) Shi	3872c4451f	fix(kafka): olp fix was accidentally deleted in `0fd8880d0a`	2023-01-18 10:52:29 +01:00
Thales Macedo Garitezi	087b667263	fix(buffer_worker): allow signalling unrecoverable errors	2023-01-17 19:50:30 -03:00
Thales Macedo Garitezi	fa01deb3eb	chore: retry as much as possible, don't reply to caller too soon	2023-01-17 16:49:15 -03:00
Thales Macedo Garitezi	006b4bda97	feat(buffer_worker): monitor async workers and cancel their inflight requests upon death	2023-01-17 16:48:48 -03:00
Thales Macedo Garitezi	32a9e60313	feat(buffer_worker): also use the inflight table for sync requests Related: https://emqx.atlassian.net/browse/EMQX-8692 This should also correctly account for `retried.*` metrics for sync requests. Also fixes cases where race conditions for retrying async requests could potentially lead to inconsistent metrics. Fixes more cases where a stale reference to `replayq` was being held accidentally after a `pop`.	2023-01-17 16:48:48 -03:00
Thales Macedo Garitezi	4c25be8a2c	chore(influxdb): remove deprecated value from example	2023-01-17 16:48:48 -03:00
Stefan Strigler	f37b3e4bc4	test: test against `bridges_probe` API	2023-01-17 15:29:19 +01:00
Stefan Strigler	5fc31dcd9e	fix: use atom keys in example to make sure they exist	2023-01-17 15:29:19 +01:00
Zaiming (Stone) Shi	4a7e74f5d6	fix(schema): add password converter to ensure its binary() type	2023-01-17 11:12:17 +01:00
Shawn	1c7fe91a91	fix: typos in emqx_ee_bridge_mongodb.conf	2023-01-17 16:33:54 +08:00
JimMoen	40948f3c0b	fix(influxdb): authentication is not required for influxdb v1 api	2023-01-17 09:29:22 +08:00
Zaiming (Stone) Shi	a7fc5e8fe1	Merge pull request #9761 from zmstone/0114-fix-kafka-value-template-and-docs feat: introduce 'this' concept for placeholder, and use it in Kafka bridge	2023-01-16 13:37:29 +01:00
Zaiming (Stone) Shi	47414e0d53	docs: improve kafka key and value field description	2023-01-16 11:32:09 +01:00
Zaiming (Stone) Shi	376dab7cd8	Merge pull request #9743 from zmstone/0112-docs-fix-some-descriptions 0112 docs fix some descriptions	2023-01-16 09:56:56 +01:00
Zaiming (Stone) Shi	91c5a89985	test: wait for redis connected state the case is sometimes flaky because the health check sometimes return connecting	2023-01-14 18:33:55 +01:00
Stefan Strigler	e08c1d2229	Merge remote-tracking branch 'olcai/refactor-bridges-api' into dev/api-refactor	2023-01-13 15:49:52 +01:00
Erik Timan	7a17fb7308	test(emqx_ee_bridge): fix bridge enable/disable in kafka producer suite	2023-01-13 14:40:54 +01:00
Thales Macedo Garitezi	f25bd288ad	Merge pull request #9742 from thalesmg/expose-resource-opts-mongo-v50 feat(mongo): expose buffer worker opts to the bridge frontend (5.0)	2023-01-13 10:23:49 -03:00
Zhongwen Deng	f15b29b1ef	chore: upgrade app version	2023-01-13 18:22:29 +08:00
Zaiming (Stone) Shi	522e091dd2	docs: chane cache to buffer for Kafka bridge description	2023-01-13 07:46:23 +01:00
Erik Timan	860e21d40f	feat(emqx_bridge): move metrics to own endpoint, rename reset_metrics In order for the /bridges APIs to be consistent with other APIs, we move out metrics from GET /bridges/{id} to its own endpoint, /bridges/{id}/metrics. We also rename /bridges/reset_metrics to /bridges/metrics/reset.	2023-01-12 19:39:41 +01:00
Thales Macedo Garitezi	ed2be4d086	feat(mongo): expose buffer worker opts to the bridge frontend (5.0)	2023-01-12 15:23:41 -03:00
Shawn	1711823487	refactor: remove default value of timestamp field of influxdb	2023-01-12 22:25:44 +08:00
Xinyu Liu	bae811e8b4	Merge pull request #9725 from terry-xiaoyu/remove_the_auto_reconnect_field refactor: remove the auto_reconnect field	2023-01-12 11:11:00 +08:00
Thales Macedo Garitezi	2d5208fc29	Merge pull request #9485 from thalesmg/refactor-doc-gen-50-mkIII feat(docs): add tags to schemas	2023-01-11 16:36:38 -03:00
Thales Macedo Garitezi	9d99d180f9	Merge pull request #9709 from thalesmg/mongodb-bridge-payload-template-v50 feat(mongodb): add `payload_template` field for bridge (e5.0)	2023-01-11 11:14:46 -03:00
Shawn	3e9c4f444f	refactor: remove the auto_reconnect field	2023-01-11 21:47:06 +08:00
Thales Macedo Garitezi	a9d0143d5f	chore(license): change `key` type to `binary()` So that the `raw_default` field in the schema JSON renders properly.	2023-01-11 09:10:04 -03:00
Thales Macedo Garitezi	48e1ba4832	feat(docs): add tags to schemas This'll allow us to split the generated `schema.json` file into subsections for better documentation navigation.	2023-01-11 09:10:03 -03:00
Thales Macedo Garitezi	f3cc722172	docs: improve descriptions Co-authored-by: Zaiming (Stone) Shi <zmstone@gmail.com>	2023-01-10 09:06:12 -03:00
Zaiming (Stone) Shi	67f2159a27	Merge pull request #9653 from zmstone/0101-authz-schema-union-member-selection 0101 authz schema union member selection	2023-01-09 22:17:51 +01:00
Thales Macedo Garitezi	b9f258b737	feat(mongodb): add `payload_template` field for bridge (e5.0) https://emqx.atlassian.net/browse/EMQX-8705 Adds a `payload_template` fields that allows users to customize the payload to publish to MongoDB.	2023-01-09 13:38:11 -03:00
lafirest	20f49b903f	Merge pull request #9702 from lafirest/feat/timescale_matrix feat(bridges): add timescale && matrix bridges	2023-01-10 00:06:17 +08:00
firest	062f14bd65	test(bridges): add timescale && matrix test cases	2023-01-09 23:04:36 +08:00
firest	e08d6dbc9b	feat(bridges): add timescale && matrix bridges	2023-01-09 23:04:29 +08:00
Zaiming (Stone) Shi	c3635f537a	ci: wait for redis in emqx_ee_bridge_redis_SUITE	2023-01-09 14:39:56 +01:00
Zaiming (Stone) Shi	0697c692ed	refactor: mongo_type and redis_type are not mandatory the connector schemas are shared between authn, auth and bridges, the difference is that authn and authz configs are unions like: [ {mongo_type = rs, ... } {another backend config} ] however the brdige types are maps like, for example: mongodb_rs.$name { mongo_type = rs ... } in which case, the mongo_type is not required. in order to keep the schema static as much as possible, this field is chanegd to 'required => false' with a default value. However, for authn and authz, the union selector will still raise exception if the there is no type provided.	2023-01-09 14:26:16 +01:00
Zaiming (Stone) Shi	ffb09f0c4d	test: rename a test option name to avoid clashing with prod config name	2023-01-09 14:26:16 +01:00
Zaiming (Stone) Shi	e52f9d5920	refactor: use union member type selector for authz sources	2023-01-09 14:26:16 +01:00
Zaiming (Stone) Shi	e498010f23	chore: update default license (100 conns)	2023-01-09 09:11:09 +01:00
Zaiming (Stone) Shi	3859878985	Merge pull request #9684 from zmstone/0105-do-not-use-testcase-skip test: do not use tc_user_skip for test cases	2023-01-09 09:02:06 +01:00
firest	ea405fe55d	test(bridges): add test case for the PostgreSQL backend	2023-01-06 11:56:28 +08:00
firest	f7b50c56da	feat(bridges): integrate PostgreSQL into bridges support both simple and batch query	2023-01-06 11:47:22 +08:00
Zaiming (Stone) Shi	5f12cdff6c	ci: should not skip tests in github actions	2023-01-05 22:47:08 +01:00
Zaiming (Stone) Shi	fc12a8c4c8	test: do not use tc_user_skip for test cases	2023-01-05 22:47:06 +01:00
Thales Macedo Garitezi	af31ed4264	docs: improve descriptions Co-authored-by: Zaiming (Stone) Shi <zmstone@gmail.com>	2023-01-05 10:16:00 -03:00
Thales Macedo Garitezi	fd360ac6c0	feat(buffer_worker): refactor buffer/resource workers to always use queue This makes the buffer/resource workers always use `replayq` for queuing, along with collecting multiple requests in a single call. This is done to avoid long message queues for the buffer workers and rely on `replayq`'s capabilities of offloading to disk and detecting overflow. Also, this deprecates the `enable_batch` and `enable_queue` resource creation options, as: i) queuing is now always enables; ii) batch_size > 1 <=> batch_enabled. The corresponding metric `dropped.queue_not_enabled` is dropped, along with `batching`. The batching is too ephemeral, especially considering a default batch time of 20 ms, and is not shown in the dashboard, so it was removed.	2023-01-05 10:15:09 -03:00
Thales Macedo Garitezi	0fd8880d0a	fix(kafka_producer): avoid multiplication of metrics when bridge is recreated	2023-01-05 10:11:59 -03:00
Thales Macedo Garitezi	8e59319bfe	fix(kafka_producer): fix message loss when kafka connection is down	2023-01-05 10:11:59 -03:00
Thales Macedo Garitezi	5bd9f110d6	test: attempt to reduce flakiness	2023-01-05 10:11:59 -03:00
Thales Macedo Garitezi	3304025a9a	Merge pull request #9677 from thalesmg/test-other-gcp-flaky-v50 test: attempt to fix flaky test	2023-01-04 13:30:53 -03:00
Thales Macedo Garitezi	5df485df17	test: attempt to fix flaky test	2023-01-04 10:38:21 -03:00
Thales Macedo Garitezi	4feaf5916f	fix(mongodb): add `type` and `name` fields to the `get`/`post` API docs https://emqx.atlassian.net/browse/EMQX-8643 Adds thoses fields to MongoDB API docs so the frontend may use them to render properly.	2023-01-03 11:46:58 -03:00
Thales Macedo Garitezi	d6c8a106da	test(gcp_pubsub): fix flaky test	2023-01-03 08:52:37 -03:00
lafirest	7985cd3536	Merge pull request #9638 from lafirest/fix/mysql_dup fix(mysql): fix the problem of data loss and bad match when mysql is disconnected	2023-01-02 23:15:44 +08:00
Thales Macedo Garitezi	7e02eac3bc	Merge pull request #9619 from thalesmg/refactor-gauges-v50 refactor(metrics): use absolute gauge values rather than deltas (v5.0)	2023-01-02 10:56:47 -03:00
firest	c77717b1f1	test(mysql): fix test case error	2023-01-02 21:55:22 +08:00
Thales Macedo Garitezi	69ac6b9e0d	fix(kafka): fix handling of `dropped.queue_full` event from wolff https://emqx.atlassian.net/browse/EMQX-8530 `cd20a37e65/src/wolff_producer.erl (L772-L773)` Wolff emits 2 events related to dropped messages when replayq reports overflow. Since in EMQX's side we also bump `dropped` when `dropped_queue_full` happens, that was leading to wrong metrics.	2023-01-02 09:23:38 -03:00
Zaiming (Stone) Shi	fc6d9d0890	chore: issue new Enterprise evaluation license	2023-01-02 11:50:57 +01:00
Zaiming (Stone) Shi	b5d2fc0ede	chore: update BSL license year	2023-01-02 11:23:14 +01:00
Zaiming (Stone) Shi	dbc10c2eed	chore: update copyright year 2023	2023-01-02 09:22:27 +01:00
Zaiming (Stone) Shi	231d6da4dc	Merge pull request #9644 from id/fix-kafka-bridge-connectivity-detection fix(kafka): detect connectivity in on_get_status	2022-12-30 21:54:56 +01:00
Thales Macedo Garitezi	61246c43c4	fix(kakfa_producer): prevent multiple producers from multiplying each other's metrics	2022-12-30 16:51:24 -03:00
Thales Macedo Garitezi	8b060a75f1	refactor(metrics): use absolute gauge values rather than deltas https://emqx.atlassian.net/browse/EMQX-8548 Currently, we face several issues trying to keep resource metrics reasonable. For example, when a resource is re-created and has its metrics reset, but then its durable queue resumes its previous work and leads to strange (often negative) metrics. Instead using `counters` that are shared by more than one worker to manage gauges, we introduce an ETS table whose key is not only scoped by the Resource ID as before, but also by the worker ID. This way, when a worker starts/terminates, they should set their own gauges to their values (often 0 or `replayq:count` when resuming off a queue). With this scoping and initialization procedure, we'll hopefully avoid hitting those strange metrics scenarios and have better control over the gauges.	2022-12-30 16:51:24 -03:00
Zaiming (Stone) Shi	0ce1ca89b7	refactor: use string type for server and servers	2022-12-30 14:20:23 +01:00
Thales Macedo Garitezi	a612eacf3a	Merge pull request #9641 from thalesmg/fix-gcp-pubsub-jwt-refresh-v50 fix(gcp_pubsub): fix potential jwt accumulation and lack of refresh (v5.0)	2022-12-30 09:33:41 -03:00
Ivan Dyachkov	75770f2842	fix(kafka): detect connectivity in on_get_status	2022-12-30 10:18:52 +01:00
Thales Macedo Garitezi	24bae2641e	chore: upgrade wolff -> 1.7.4	2022-12-29 16:58:00 -03:00
Thales Macedo Garitezi	446a4c74d0	fix(gcp_pubsub): fix potential jwt accumulation and lack of refresh (v5.0) https://emqx.atlassian.net/browse/EMQX-8653 Related: - https://emqx.atlassian.net/browse/EEC-737 - https://emqx.atlassian.net/browse/EMQX-8652 Since the rule resource testing mechanism creates a new resource to test the configuration, a new JWT associated with an unique temporary resource was being created and left in the JWT table, leaking it. Also, a wrong case clause when setting the new refresh timer for the JWT worker was preventing it from refreshing from the 2nd refresh onward.	2022-12-29 16:30:36 -03:00
Zaiming (Stone) Shi	728cb85156	Merge pull request #9629 from zmstone/1128-fix-flaky-test-listener-restart-eaddrinuse test: add a 2 seconds sleep between listener sotp and restart	2022-12-29 16:04:19 +01:00
Zaiming (Stone) Shi	5ae4bd6722	chore: bump influxdb client lib version from 1.1.4 to 1.1.5 fixed a compile warning	2022-12-29 10:29:36 +01:00
Zaiming (Stone) Shi	a5e3f6f05b	fix(kafka): Memory OLP is only applicable in linux systems	2022-12-28 15:31:53 +01:00
Thales Macedo Garitezi	ce43e6b3d6	chore: upgrade kafka_protocol, wolff, brod	2022-12-27 10:30:14 -03:00
Thales Macedo Garitezi	4819794401	test(refactor): decrease test teardown noise	2022-12-27 10:30:14 -03:00
Thales Macedo Garitezi	a78ecc4bb6	Merge pull request #9579 from thalesmg/bugfix-kafka-producer-stop-client-ee50 fix(kafka_producer): cleanup client after failing to start producers	2022-12-26 08:59:36 -03:00
Zaiming (Stone) Shi	59970272ff	chore: pin hocon 0.33.0	2022-12-23 12:07:36 +01:00
Erik Timan	13942f5c49	refactor: rename error return in mysql connector	2022-12-22 10:29:12 +01:00
Erik Timan	aab914d65a	test: refactor EE mysql bridge test case Co-authored-by: Thales Macedo Garitezi <thalesmg@gmail.com>	2022-12-22 10:27:38 +01:00
Erik Timan	7df24000a0	test: add more EE mysql bridge test cases	2022-12-22 10:27:38 +01:00
Thales Macedo Garitezi	7242ce426b	fix(kafka_producer): cleanup client after failing to start producers https://emqx.atlassian.net/browse/EMQX-8547 If a Kafka Producer bridge is given bad configuration (e.g.: bad authn credentials), the Wolff client process is started successfully, as it does not attempt to connect, but when the producer process is attempted to be started, it fails (only then the client tries to connect to Kafka). At this point, an error was thrown, but the supervised client process remained running. If the configuration was later fixed and the bridge updated, which prompted its removal and recreation, the Wolff client would report to be "already started", so it would never pick up the new (fixed) configuration, and the producers would perpetually fail to start until the node would be restarted. We simply ensure the client is stopped before throwing the error, unrolling the start-up procedure.	2022-12-19 17:48:40 -03:00

1 2 3 4 5 ...

348 Commits