stackoverflow November 2, 2025 Rep: 53

Implementing a LoRA wrapper for Conv2D in Tensorflow

Score

Answers

213

Views

29.0

Trend Score

Question Details

No question body available.

Answers (1)

Accepted Answer Available

Accepted Answer

November 2, 2025 Score: 0 Rep: 1,726 Quality: High Completeness: 60%

A first approach that ensures gradients always flow through the LoRA matrices, and that use a training argument to control behavior at runtime:

def call(self, inputs, training=None):
    # Compute LoRA:
    deltaW = tf.matmul(self.loraB, self.loraA)
    deltaW = tf.reshape(deltaW, self.kernelshape)
    # Apply scaling based on training:
    if training and self.loraenabled and not self.merged:
        effectivedelta = self.scaling * deltaW
    else:
        effectivedelta = tf.zeroslike(deltaW)
    outputs = tf.nn.conv2d(
        inputs,
        self.originallayer.kernel + effectivedelta,
        strides=self.originallayer.strides,
        padding=self.originallayer.padding.upper()
    )
    if self.originallayer.usebias:
        outputs = tf.nn.biasadd(outputs, self.originallayer.bias)
    if self.originallayer.activation is not None:
        outputs = self.originallayer.activation(outputs)
    return outputs

Another approach is to keep the LoRA matrices always trainable, but mask their contribution:

def build(self, inputshape):
    # ... existing code ...
    # Always trainable, control via scaling
    self.loraA = self.addweight(
        name="LoRAmatA",
        shape=loraAshape,
        initializer=keras.initializers.HeUniform(),
        trainable=True,  # Always True
    )
    self.loraB = self.addweight(
        name="LoRAmatB",
        shape=loraBshape,
        initializer=keras.initializers.Zeros(),
        trainable=True,  # Always True
    )
    # Control via this flag
    self.loraactive = tf.Variable(
        initialvalue=self.loraenabled,
        trainable=False,
        dtype=tf.bool
    )
def call(self, inputs):
    deltaW = tf.matmul(self.loraB, self.loraA)
    deltaW = tf.reshape(deltaW, self.kernelshape)
    # Use tf.cond for runtime switching
    effectivescaling = tf.cond(
        self.loraactive,
        lambda: self.scaling,
        lambda: 0.0
    )
    #...

Export Question Data

Export this question and its answers for further analysis or reporting.

Back to Questions

Implementing a LoRA wrapper for Conv2D in Tensorflow

Question Details

Tags

Answers (1)

Analysis Metrics

Question Information

Actions

Related Questions

Export Question Data