Configuration Options

Confluent Platform

confluent.license

Confluent will issue a license key to each subscriber. The license key will be a short snippet of text that you can copy and paste. Without the license key, you can use Replicator for a 30-day trial period. If you are a subscriber, please contact Confluent Support for more information.

  • Type: string
  • Valid Values: Confluent Platform license
  • Importance: high

Source Topics

topic.regex

Regex of topics to replicate to the destination cluster.

  • Type: string
  • Default: null
  • Importance: high
topic.whitelist

Whitelist of topics to be replicated.

  • Type: list
  • Default: “”
  • Importance: high
topic.blacklist

Topics to exclude from replication.

  • Type: list
  • Default: “”
  • Importance: high
topic.poll.interval.ms

How often to poll the source cluster for new topics matching topic.whitelist or topic.regex.

  • Type: int
  • Default: 120000
  • Valid Values: [0,...]
  • Importance: low

Source Data Conversion

src.key.converter

Converter for the key field of messages retrieved from the source cluster.

  • Type: class
  • Default: io.confluent.connect.replicator.util.ByteArrayConverter
  • Importance: low
src.value.converter

Converter for the value field of messages retrieved from the source cluster.

  • Type: class
  • Default: io.confluent.connect.replicator.util.ByteArrayConverter
  • Importance: low

Source Zookeeper

src.zookeeper.connect

Zookeeper connection string for the source cluster.

  • Type: string
  • Importance: high
src.zookeeper.connection.timeout.ms

Connection timeout in milliseconds for the source Zookeeper cluster.

  • Type: int
  • Default: 6000
  • Valid Values: [0,...]
  • Importance: low
src.zookeeper.session.timeout.ms

Session timeout in milliseconds for the source Zookeeper cluster.

  • Type: int
  • Default: 6000
  • Valid Values: [0,...]
  • Importance: low

Source Kafka

src.kafka.bootstrap.servers

A list of host/port pairs to use for establishing the initial connection to the Kafka cluster. The client will make use of all servers irrespective of which servers are specified here for bootstrapping — this list only impacts the initial hosts used to discover the full set of servers. This list should be in the form host1:port1,host2:port2,.... Since these servers are just used for the initial connection to discover the full cluster membership (which may change dynamically), this list need not contain the full set of servers (you may want more than one, though, in case a server is down).

  • Type: list
  • Importance: high
src.kafka.client.id

An id string to pass to the server when making requests. The purpose of this is to be able to track the source of requests beyond just ip/port by allowing a logical application name to be included in server-side request logging.

  • Type: string
  • Default: “”
  • Importance: low
src.kafka.request.timeout.ms

The configuration controls the maximum amount of time the client will wait for the response of a request. If the response is not received before the timeout elapses the client will resend the request if necessary or fail the request if retries are exhausted.

  • Type: int
  • Default: 305000
  • Valid Values: [0,...]
  • Importance: medium
src.kafka.retry.backoff.ms

The amount of time to wait before attempting to retry a failed request to a given topic partition. This avoids repeatedly sending requests in a tight loop under some failure scenarios.

  • Type: long
  • Default: 100
  • Valid Values: [0,...]
  • Importance: low
src.kafka.connections.max.idle.ms

Close idle connections after the number of milliseconds specified by this config.

  • Type: long
  • Default: 540000
  • Importance: medium
src.kafka.reconnect.backoff.ms

The amount of time to wait before attempting to reconnect to a given host. This avoids repeatedly connecting to a host in a tight loop. This backoff applies to all requests sent by the consumer to the broker.

  • Type: long
  • Default: 50
  • Valid Values: [0,...]
  • Importance: low
src.kafka.metric.reporters

A list of classes to use as metrics reporters. Implementing the MetricReporter interface allows plugging in classes that will be notified of new metric creation. The JmxReporter is always included to register JMX statistics.

  • Type: list
  • Default: “”
  • Importance: low
src.kafka.metrics.num.samples

The number of samples maintained to compute metrics.

  • Type: int
  • Default: 2
  • Valid Values: [1,...]
  • Importance: low
src.kafka.metrics.sample.window.ms

The window of time a metrics sample is computed over.

  • Type: long
  • Default: 30000
  • Valid Values: [0,...]
  • Importance: low
src.kafka.send.buffer.bytes

The size of the TCP send buffer (SO_SNDBUF) to use when sending data. If the value is -1, the OS default will be used.

  • Type: int
  • Default: 131072
  • Valid Values: [-1,...]
  • Importance: medium
src.kafka.receive.buffer.bytes

The size of the TCP receive buffer (SO_RCVBUF) to use when reading data. If the value is -1, the OS default will be used.

  • Type: int
  • Default: 65536
  • Valid Values: [-1,...]
  • Importance: medium

Source Kafka: Security

If you have enabled SSL or SASL in your Kafka cluster, then you must make sure that Replicator and Kafka Connect are also configured for security. Click on the section to configure encryption or authentication in Replicator and Kafka Connect:

Additional descriptions of the configuration parameters are below:

src.kafka.security.protocol

Protocol used to communicate with brokers. Valid values are: PLAINTEXT, SSL, SASL_PLAINTEXT, SASL_SSL.

  • Type: string
  • Default: PLAINTEXT
  • Valid Values: [PLAINTEXT, SSL, SASL_PLAINTEXT, SASL_SSL]
  • Importance: medium
src.kafka.sasl.mechanism

SASL mechanism used for client connections. This may be any mechanism for which a security provider is available. GSSAPI is the default mechanism.

  • Type: string
  • Default: GSSAPI
  • Importance: medium
src.kafka.sasl.kerberos.ticket.renew.window.factor

Login thread will sleep until the specified window factor of time from last refresh to ticket’s expiry has been reached, at which time it will try to renew the ticket.

  • Type: double
  • Default: 0.8
  • Importance: low
src.kafka.sasl.kerberos.min.time.before.relogin

Login thread sleep time between refresh attempts.

  • Type: long
  • Default: 60000
  • Importance: low
src.kafka.sasl.kerberos.kinit.cmd

Kerberos kinit command path.

  • Type: string
  • Default: /usr/bin/kinit
  • Importance: low
src.kafka.sasl.kerberos.service.name

The Kerberos principal name that Kafka runs as. This can be defined either in Kafka’s JAAS config or in Kafka’s config.

  • Type: string
  • Default: null
  • Importance: medium
src.kafka.sasl.kerberos.ticket.renew.jitter

Percentage of random jitter added to the renewal time.

  • Type: double
  • Default: 0.05
  • Importance: low
src.kafka.ssl.protocol

The SSL protocol used to generate the SSLContext. Default setting is TLS, which is fine for most cases. Allowed values in recent JVMs are TLS, TLSv1.1 and TLSv1.2. SSL, SSLv2 and SSLv3 may be supported in older JVMs, but their usage is discouraged due to known security vulnerabilities.

  • Type: string
  • Default: TLS
  • Importance: medium
src.kafka.ssl.provider

The name of the security provider used for SSL connections. Default value is the default security provider of the JVM.

  • Type: string
  • Default: null
  • Importance: medium
src.kafka.ssl.enabled.protocols

The list of protocols enabled for SSL connections.

  • Type: list
  • Default: TLSv1.2,TLSv1.1,TLSv1
  • Importance: medium
src.kafka.ssl.keystore.location

The location of the key store file. This is optional for client and can be used for two-way authentication for client.

  • Type: string
  • Default: null
  • Importance: high
src.kafka.ssl.cipher.suites

A list of cipher suites. This is a named combination of authentication, encryption, MAC and key exchange algorithm used to negotiate the security settings for a network connection using TLS or SSL network protocol.By default all the available cipher suites are supported.

  • Type: list
  • Default: null
  • Importance: low
src.kafka.ssl.secure.random.implementation

The SecureRandom PRNG implementation to use for SSL cryptography operations.

  • Type: string
  • Default: null
  • Importance: low
src.kafka.ssl.truststore.type

The file format of the trust store file.

  • Type: string
  • Default: JKS
  • Importance: medium
src.kafka.ssl.keystore.type

The file format of the key store file. This is optional for client.

  • Type: string
  • Default: JKS
  • Importance: medium
src.kafka.ssl.trustmanager.algorithm

The algorithm used by trust manager factory for SSL connections. Default value is the trust manager factory algorithm configured for the Java Virtual Machine.

  • Type: string
  • Default: PKIX
  • Importance: low
src.kafka.ssl.truststore.location

The location of the trust store file.

  • Type: string
  • Default: null
  • Importance: high
src.kafka.ssl.keystore.password

The store password for the key store file.This is optional for client and only needed if ssl.keystore.location is configured.

  • Type: password
  • Default: null
  • Importance: high
src.kafka.ssl.keymanager.algorithm

The algorithm used by key manager factory for SSL connections. Default value is the key manager factory algorithm configured for the Java Virtual Machine.

  • Type: string
  • Default: SunX509
  • Importance: low
src.kafka.ssl.key.password

The password of the private key in the key store file. This is optional for client.

  • Type: password
  • Default: null
  • Importance: high
src.kafka.ssl.truststore.password

The password for the trust store file.

  • Type: password
  • Default: null
  • Importance: high
src.kafka.ssl.endpoint.identification.algorithm

The endpoint identification algorithm to validate server hostname using server certificate.

  • Type: string
  • Default: null
  • Importance: low

Source Kafka: Consumer

src.consumer.interceptor.classes

A list of classes to use as interceptors. Implementing the ConsumerInterceptor interface allows you to intercept (and possibly mutate) records received by the consumer. By default, there are no interceptors.

  • Type: list
  • Default: null
  • Importance: low
src.consumer.fetch.max.wait.ms

The maximum amount of time the server will block before answering the fetch request if there isn’t sufficient data to immediately satisfy the requirement given by fetch.min.bytes.

  • Type: int
  • Default: 500
  • Valid Values: [0,...]
  • Importance: low
src.consumer.fetch.min.bytes

The minimum amount of data the server should return for a fetch request. If insufficient data is available the request will wait for that much data to accumulate before answering the request. The default setting of 1 byte means that fetch requests are answered as soon as a single byte of data is available or the fetch request times out waiting for data to arrive. Setting this to something greater than 1 will cause the server to wait for larger amounts of data to accumulate which can improve server throughput a bit at the cost of some additional latency.

  • Type: int
  • Default: 1
  • Valid Values: [0,...]
  • Importance: high
src.consumer.fetch.max.bytes

The maximum amount of data the server should return for a fetch request. This is not an absolute maximum, if the first message in the first non-empty partition of the fetch is larger than this value, the message will still be returned to ensure that the consumer can make progress. The maximum message size accepted by the broker is defined via message.max.bytes (broker config) or max.message.bytes (topic config). Note that the consumer performs multiple fetches in parallel.

  • Type: int
  • Default: 52428800
  • Valid Values: [0,...]
  • Importance: medium
src.consumer.max.partition.fetch.bytes

The maximum amount of data per-partition the server will return. If the first message in the first non-empty partition of the fetch is larger than this limit, the message will still be returned to ensure that the consumer can make progress. The maximum message size accepted by the broker is defined via message.max.bytes (broker config) or max.message.bytes (topic config). See fetch.max.bytes for limiting the consumer request size

  • Type: int
  • Default: 1048576
  • Valid Values: [0,...]
  • Importance: high
src.consumer.max.poll.interval.ms

The maximum delay between invocations of poll() when using consumer group management. This places an upper bound on the amount of time that the consumer can be idle before fetching more records. If poll() is not called before expiration of this timeout, then the consumer is considered failed and the group will rebalance in order to reassign the partitions to another member.

  • Type: int
  • Default: 300000
  • Valid Values: [1,...]
  • Importance: medium
src.consumer.max.poll.records

The maximum number of records returned in a single call to poll().

  • Type: int
  • Default: 500
  • Valid Values: [1,...]
  • Importance: medium
src.consumer.check.crcs

Automatically check the CRC32 of the records consumed. This ensures no on-the-wire or on-disk corruption to the messages occurred. This check adds some overhead, so it may be disabled in cases seeking extreme performance.

  • Type: boolean
  • Default: true
  • Importance: low

Destination Topics

topic.rename.format

A format string for the topic name in the destination cluster, which may contain ‘${topic}’ as a placeholder for the originating topic name. For example, dc_${topic} for the topic ‘orders’ will map to the destination topic name ‘dc_orders’.

Be careful of the potential for topic name collisions when configuring replicators from multiple source clusters. We typically recommend that each cluster be given a distinct prefix or suffix (as in the example above).

  • Type: string
  • Default: ${topic}
  • Importance: high
topic.auto.create

Whether to automatically create topics in the destination cluster if required.

  • Type: boolean
  • Default: true
  • Importance: low
topic.preserve.partitions

Whether to automatically increase the number of partitions in the destination cluster to match the source cluster and ensure that messages replicated from the source cluster use the same partition in the destination cluster.

  • Type: boolean
  • Default: true
  • Importance: low
topic.create.backoff.ms

Time to wait before retrying auto topic creation or expansion.

  • Type: int
  • Default: 120000
  • Valid Values: [0,...]
  • Importance: low
topic.config.sync

Whether to periodically sync topic configuration to the destination cluster.

  • Type: boolean
  • Default: true
  • Importance: low
topic.config.sync.interval.ms

How often to check for configuration changes when topic.config.sync is enabled.

  • Type: int
  • Default: 120000
  • Valid Values: [0,...]
  • Importance: low
topic.timestamp.type

The timestamp type for the topics in the destination cluster.

  • Type: string
  • Default: CreateTime
  • Valid Values: [CreateTime, LogAppendTime]
  • Importance: low

Destination Zookeeper

dest.zookeeper.connect

Zookeeper connection string for the destination cluster.

  • Type: string
  • Importance: high
dest.zookeeper.connection.timeout.ms

Connection timeout in milliseconds for the destination Zookeeper cluster.

  • Type: int
  • Default: 6000
  • Valid Values: [0,...]
  • Importance: low
dest.zookeeper.session.timeout.ms

Session timeout in milliseconds for the destination Zookeeper cluster.

  • Type: int
  • Default: 6000
  • Valid Values: [0,...]
  • Importance: low

Destination Data Conversion

key.converter

Converter for the key field of messages written to the destination cluster. Use io.confluent.connect.replicator.util.ByteArrayConverter if you don’t need data conversion.

  • Type: class
  • Default: inherited from worker configuration
  • Importance: low
value.converter

Converter for the value field of messages written to the destination cluster. Use io.confluent.connect.replicator.util.ByteArrayConverter if you don’t need data conversion.

  • Type: class
  • Default: inherited from worker configuration
  • Importance: low