Configuration Options¶
Confluent Platform¶
confluent.license
Confluent will issue a license key to each subscriber. The license key will be a short snippet of text that you can copy and paste. Without the license key, you can use Replicator for a 30-day trial period. If you are a subscriber, please contact Confluent Support for more information.
- Type: string
- Valid Values: Confluent Platform license
- Importance: high
Source Topics¶
topic.regex
Regex of topics to replicate to the destination cluster.
- Type: string
- Default: null
- Importance: high
topic.whitelist
Whitelist of topics to be replicated.
- Type: list
- Default: “”
- Importance: high
topic.blacklist
Topics to exclude from replication.
- Type: list
- Default: “”
- Importance: high
topic.poll.interval.ms
How often to poll the source cluster for new topics matching topic.whitelist or topic.regex.
- Type: int
- Default: 120000
- Valid Values: [0,...]
- Importance: low
Source Data Conversion¶
src.key.converter
Converter for the key field of messages retrieved from the source cluster.
- Type: class
- Default: io.confluent.connect.replicator.util.ByteArrayConverter
- Importance: low
src.value.converter
Converter for the value field of messages retrieved from the source cluster.
- Type: class
- Default: io.confluent.connect.replicator.util.ByteArrayConverter
- Importance: low
Source Zookeeper¶
src.zookeeper.connect
Zookeeper connection string for the source cluster.
- Type: string
- Importance: high
src.zookeeper.connection.timeout.ms
Connection timeout in milliseconds for the source Zookeeper cluster.
- Type: int
- Default: 6000
- Valid Values: [0,...]
- Importance: low
src.zookeeper.session.timeout.ms
Session timeout in milliseconds for the source Zookeeper cluster.
- Type: int
- Default: 6000
- Valid Values: [0,...]
- Importance: low
Source Kafka¶
src.kafka.bootstrap.servers
A list of host/port pairs to use for establishing the initial connection to the Kafka cluster. The client will make use of all servers irrespective of which servers are specified here for bootstrapping — this list only impacts the initial hosts used to discover the full set of servers. This list should be in the form
host1:port1,host2:port2,...
. Since these servers are just used for the initial connection to discover the full cluster membership (which may change dynamically), this list need not contain the full set of servers (you may want more than one, though, in case a server is down).- Type: list
- Importance: high
src.kafka.client.id
An id string to pass to the server when making requests. The purpose of this is to be able to track the source of requests beyond just ip/port by allowing a logical application name to be included in server-side request logging.
- Type: string
- Default: “”
- Importance: low
src.kafka.request.timeout.ms
The configuration controls the maximum amount of time the client will wait for the response of a request. If the response is not received before the timeout elapses the client will resend the request if necessary or fail the request if retries are exhausted.
- Type: int
- Default: 305000
- Valid Values: [0,...]
- Importance: medium
src.kafka.retry.backoff.ms
The amount of time to wait before attempting to retry a failed request to a given topic partition. This avoids repeatedly sending requests in a tight loop under some failure scenarios.
- Type: long
- Default: 100
- Valid Values: [0,...]
- Importance: low
src.kafka.connections.max.idle.ms
Close idle connections after the number of milliseconds specified by this config.
- Type: long
- Default: 540000
- Importance: medium
src.kafka.reconnect.backoff.ms
The amount of time to wait before attempting to reconnect to a given host. This avoids repeatedly connecting to a host in a tight loop. This backoff applies to all requests sent by the consumer to the broker.
- Type: long
- Default: 50
- Valid Values: [0,...]
- Importance: low
src.kafka.metric.reporters
A list of classes to use as metrics reporters. Implementing the
MetricReporter
interface allows plugging in classes that will be notified of new metric creation. The JmxReporter is always included to register JMX statistics.- Type: list
- Default: “”
- Importance: low
src.kafka.metrics.num.samples
The number of samples maintained to compute metrics.
- Type: int
- Default: 2
- Valid Values: [1,...]
- Importance: low
src.kafka.metrics.sample.window.ms
The window of time a metrics sample is computed over.
- Type: long
- Default: 30000
- Valid Values: [0,...]
- Importance: low
src.kafka.send.buffer.bytes
The size of the TCP send buffer (SO_SNDBUF) to use when sending data. If the value is -1, the OS default will be used.
- Type: int
- Default: 131072
- Valid Values: [-1,...]
- Importance: medium
src.kafka.receive.buffer.bytes
The size of the TCP receive buffer (SO_RCVBUF) to use when reading data. If the value is -1, the OS default will be used.
- Type: int
- Default: 65536
- Valid Values: [-1,...]
- Importance: medium
Source Kafka: Security¶
If you have enabled SSL or SASL in your Kafka cluster, then you must make sure that Replicator and Kafka Connect are also configured for security. Click on the section to configure encryption or authentication in Replicator and Kafka Connect:
- Encryption with SSL
- Authentication with SSL
- Authentication with SASL/GSSAPI
- Authentication with SASL/SCRAM
- Authentication with SASL/PLAIN
Additional descriptions of the configuration parameters are below:
src.kafka.security.protocol
Protocol used to communicate with brokers. Valid values are: PLAINTEXT, SSL, SASL_PLAINTEXT, SASL_SSL.
- Type: string
- Default: PLAINTEXT
- Valid Values: [PLAINTEXT, SSL, SASL_PLAINTEXT, SASL_SSL]
- Importance: medium
src.kafka.sasl.mechanism
SASL mechanism used for client connections. This may be any mechanism for which a security provider is available. GSSAPI is the default mechanism.
- Type: string
- Default: GSSAPI
- Importance: medium
src.kafka.sasl.kerberos.ticket.renew.window.factor
Login thread will sleep until the specified window factor of time from last refresh to ticket’s expiry has been reached, at which time it will try to renew the ticket.
- Type: double
- Default: 0.8
- Importance: low
src.kafka.sasl.kerberos.min.time.before.relogin
Login thread sleep time between refresh attempts.
- Type: long
- Default: 60000
- Importance: low
src.kafka.sasl.kerberos.kinit.cmd
Kerberos kinit command path.
- Type: string
- Default: /usr/bin/kinit
- Importance: low
src.kafka.sasl.kerberos.service.name
The Kerberos principal name that Kafka runs as. This can be defined either in Kafka’s JAAS config or in Kafka’s config.
- Type: string
- Default: null
- Importance: medium
src.kafka.sasl.kerberos.ticket.renew.jitter
Percentage of random jitter added to the renewal time.
- Type: double
- Default: 0.05
- Importance: low
src.kafka.ssl.protocol
The SSL protocol used to generate the SSLContext. Default setting is TLS, which is fine for most cases. Allowed values in recent JVMs are TLS, TLSv1.1 and TLSv1.2. SSL, SSLv2 and SSLv3 may be supported in older JVMs, but their usage is discouraged due to known security vulnerabilities.
- Type: string
- Default: TLS
- Importance: medium
src.kafka.ssl.provider
The name of the security provider used for SSL connections. Default value is the default security provider of the JVM.
- Type: string
- Default: null
- Importance: medium
src.kafka.ssl.enabled.protocols
The list of protocols enabled for SSL connections.
- Type: list
- Default: TLSv1.2,TLSv1.1,TLSv1
- Importance: medium
src.kafka.ssl.keystore.location
The location of the key store file. This is optional for client and can be used for two-way authentication for client.
- Type: string
- Default: null
- Importance: high
src.kafka.ssl.cipher.suites
A list of cipher suites. This is a named combination of authentication, encryption, MAC and key exchange algorithm used to negotiate the security settings for a network connection using TLS or SSL network protocol.By default all the available cipher suites are supported.
- Type: list
- Default: null
- Importance: low
src.kafka.ssl.secure.random.implementation
The SecureRandom PRNG implementation to use for SSL cryptography operations.
- Type: string
- Default: null
- Importance: low
src.kafka.ssl.truststore.type
The file format of the trust store file.
- Type: string
- Default: JKS
- Importance: medium
src.kafka.ssl.keystore.type
The file format of the key store file. This is optional for client.
- Type: string
- Default: JKS
- Importance: medium
src.kafka.ssl.trustmanager.algorithm
The algorithm used by trust manager factory for SSL connections. Default value is the trust manager factory algorithm configured for the Java Virtual Machine.
- Type: string
- Default: PKIX
- Importance: low
src.kafka.ssl.truststore.location
The location of the trust store file.
- Type: string
- Default: null
- Importance: high
src.kafka.ssl.keystore.password
The store password for the key store file.This is optional for client and only needed if ssl.keystore.location is configured.
- Type: password
- Default: null
- Importance: high
src.kafka.ssl.keymanager.algorithm
The algorithm used by key manager factory for SSL connections. Default value is the key manager factory algorithm configured for the Java Virtual Machine.
- Type: string
- Default: SunX509
- Importance: low
src.kafka.ssl.key.password
The password of the private key in the key store file. This is optional for client.
- Type: password
- Default: null
- Importance: high
src.kafka.ssl.truststore.password
The password for the trust store file.
- Type: password
- Default: null
- Importance: high
src.kafka.ssl.endpoint.identification.algorithm
The endpoint identification algorithm to validate server hostname using server certificate.
- Type: string
- Default: null
- Importance: low
Source Kafka: Consumer¶
src.consumer.interceptor.classes
A list of classes to use as interceptors. Implementing the
ConsumerInterceptor
interface allows you to intercept (and possibly mutate) records received by the consumer. By default, there are no interceptors.- Type: list
- Default: null
- Importance: low
src.consumer.fetch.max.wait.ms
The maximum amount of time the server will block before answering the fetch request if there isn’t sufficient data to immediately satisfy the requirement given by fetch.min.bytes.
- Type: int
- Default: 500
- Valid Values: [0,...]
- Importance: low
src.consumer.fetch.min.bytes
The minimum amount of data the server should return for a fetch request. If insufficient data is available the request will wait for that much data to accumulate before answering the request. The default setting of 1 byte means that fetch requests are answered as soon as a single byte of data is available or the fetch request times out waiting for data to arrive. Setting this to something greater than 1 will cause the server to wait for larger amounts of data to accumulate which can improve server throughput a bit at the cost of some additional latency.
- Type: int
- Default: 1
- Valid Values: [0,...]
- Importance: high
src.consumer.fetch.max.bytes
The maximum amount of data the server should return for a fetch request. This is not an absolute maximum, if the first message in the first non-empty partition of the fetch is larger than this value, the message will still be returned to ensure that the consumer can make progress. The maximum message size accepted by the broker is defined via
message.max.bytes
(broker config) ormax.message.bytes
(topic config). Note that the consumer performs multiple fetches in parallel.- Type: int
- Default: 52428800
- Valid Values: [0,...]
- Importance: medium
src.consumer.max.partition.fetch.bytes
The maximum amount of data per-partition the server will return. If the first message in the first non-empty partition of the fetch is larger than this limit, the message will still be returned to ensure that the consumer can make progress. The maximum message size accepted by the broker is defined via
message.max.bytes
(broker config) ormax.message.bytes
(topic config). See fetch.max.bytes for limiting the consumer request size- Type: int
- Default: 1048576
- Valid Values: [0,...]
- Importance: high
src.consumer.max.poll.interval.ms
The maximum delay between invocations of poll() when using consumer group management. This places an upper bound on the amount of time that the consumer can be idle before fetching more records. If poll() is not called before expiration of this timeout, then the consumer is considered failed and the group will rebalance in order to reassign the partitions to another member.
- Type: int
- Default: 300000
- Valid Values: [1,...]
- Importance: medium
src.consumer.max.poll.records
The maximum number of records returned in a single call to poll().
- Type: int
- Default: 500
- Valid Values: [1,...]
- Importance: medium
src.consumer.check.crcs
Automatically check the CRC32 of the records consumed. This ensures no on-the-wire or on-disk corruption to the messages occurred. This check adds some overhead, so it may be disabled in cases seeking extreme performance.
- Type: boolean
- Default: true
- Importance: low
Destination Topics¶
topic.rename.format
A format string for the topic name in the destination cluster, which may contain ‘${topic}’ as a placeholder for the originating topic name. For example,
dc_${topic}
for the topic ‘orders’ will map to the destination topic name ‘dc_orders’.Be careful of the potential for topic name collisions when configuring replicators from multiple source clusters. We typically recommend that each cluster be given a distinct prefix or suffix (as in the example above).
- Type: string
- Default: ${topic}
- Importance: high
topic.auto.create
Whether to automatically create topics in the destination cluster if required.
- Type: boolean
- Default: true
- Importance: low
topic.preserve.partitions
Whether to automatically increase the number of partitions in the destination cluster to match the source cluster and ensure that messages replicated from the source cluster use the same partition in the destination cluster.
- Type: boolean
- Default: true
- Importance: low
topic.create.backoff.ms
Time to wait before retrying auto topic creation or expansion.
- Type: int
- Default: 120000
- Valid Values: [0,...]
- Importance: low
topic.config.sync
Whether to periodically sync topic configuration to the destination cluster.
- Type: boolean
- Default: true
- Importance: low
topic.config.sync.interval.ms
How often to check for configuration changes when
topic.config.sync
is enabled.- Type: int
- Default: 120000
- Valid Values: [0,...]
- Importance: low
topic.timestamp.type
The timestamp type for the topics in the destination cluster.
- Type: string
- Default: CreateTime
- Valid Values: [CreateTime, LogAppendTime]
- Importance: low
Destination Zookeeper¶
dest.zookeeper.connect
Zookeeper connection string for the destination cluster.
- Type: string
- Importance: high
dest.zookeeper.connection.timeout.ms
Connection timeout in milliseconds for the destination Zookeeper cluster.
- Type: int
- Default: 6000
- Valid Values: [0,...]
- Importance: low
dest.zookeeper.session.timeout.ms
Session timeout in milliseconds for the destination Zookeeper cluster.
- Type: int
- Default: 6000
- Valid Values: [0,...]
- Importance: low
Destination Data Conversion¶
key.converter
Converter for the key field of messages written to the destination cluster. Use
io.confluent.connect.replicator.util.ByteArrayConverter
if you don’t need data conversion.- Type: class
- Default: inherited from worker configuration
- Importance: low
value.converter
Converter for the value field of messages written to the destination cluster. Use
io.confluent.connect.replicator.util.ByteArrayConverter
if you don’t need data conversion.- Type: class
- Default: inherited from worker configuration
- Importance: low