Bulk Insert: Reuse indexes for unique_by

I found `:unique_by` with `:columns` and `:where` inside it tough to grasp. The documentation only mentioned indexes and partial indexes. So why duplicate a model's indexes in an insert_all/upsert_all call when we can just look it up? This has the added benefit of raising if no index is found, such that people can't insert thousands of records without relying on an index of some form.
author: Kasper Timm Hansen <kaspth@gmail.com> 2019-03-09 01:15:17 +0100
committer: Kasper Timm Hansen <kaspth@gmail.com> 2019-03-20 13:28:18 +0100
commit: 2ea5c24085e39efb6c1bc6e7062bcc5f496d1fa6 (patch)
tree: 9a6448dca6dcd973957584686edf60961189ae01 /activerecord/lib/active_record/persistence.rb
parent: c11d115fa3c58ec6bde5e9799673cee381d22d47 (diff)
download: rails-2ea5c24085e39efb6c1bc6e7062bcc5f496d1fa6.tar.gz
rails-2ea5c24085e39efb6c1bc6e7062bcc5f496d1fa6.tar.bz2
rails-2ea5c24085e39efb6c1bc6e7062bcc5f496d1fa6.zip
1 files changed, 24 insertions, 41 deletions
diff --git a/activerecord/lib/active_record/persistence.rb b/activerecord/lib/active_record/persistence.rb
index 0c31f0f57e..8b7c9f59fa 100644
--- a/activerecord/lib/active_record/persistence.rb
+++ b/activerecord/lib/active_record/persistence.rb
@@ -92,25 +92,21 @@ module ActiveRecord
       #   <tt>returning: %w[ id name ]</tt> to return the id and name of every successfully inserted
       #   record or pass <tt>returning: false</tt> to omit the clause.
       #
-      # [:unique_by]
-      #   (Postgres and SQLite only) In a table with more than one unique constraint or index,
+      # [:unique_index]
+      #   (Postgres and SQLite only) In a table with more than one unique constaint or index,
       #   new records may be considered duplicates according to different criteria. By default,
-      #   new rows will be skipped if they violate _any_ unique constraint or index. By defining
-      #   <tt>:unique_by</tt>, you can skip rows that would create duplicates according to the given
+      #   new rows will be skipped if they violate _any_ unique constraint/index. By passing
+      #   <tt>:unique_index</tt>, you can skip rows that would create duplicates according to the given
       #   constraint but raise <tt>ActiveRecord::RecordNotUnique</tt> if rows violate other constraints.
       #
-      #   (For example, maybe you assume a client will try to import the same ISBNs more than
+      #   For example, maybe you assume a client will try to import the same ISBNs more than
       #   once and want to silently ignore the duplicate records, but you don't except any of
       #   your code to attempt to create two rows with the same primary key and would appreciate
-      #   an exception report in that scenario.)
+      #   an exception instead.
       #
-      #   Indexes can be identified by an array of columns:
+      #   Indexes can be identified by name:
       #
-      #     unique_by: { columns: %w[ isbn ] }
-      #
-      #   Partial indexes can be identified by an array of columns and a <tt>:where</tt> condition:
-      #
-      #     unique_by: { columns: %w[ isbn ], where: "published_on IS NOT NULL" }
+      #     unique_index: :isbn
       #
       # ==== Example
       #
@@ -120,7 +116,6 @@ module ActiveRecord
       #     { id: 1, title: 'Rework', author: 'David' },
       #     { id: 1, title: 'Eloquent Ruby', author: 'Russ' }
       #   ])
-      #
       def insert_all(attributes, returning: nil, unique_by: nil)
         InsertAll.new(self, attributes, on_duplicate: :skip, returning: returning, unique_by: unique_by).execute
       end
@@ -174,12 +169,11 @@ module ActiveRecord
       #   ])
       #
       #   # Raises ActiveRecord::RecordNotUnique because 'Eloquent Ruby'
-      #   # does not have a unique ID
+      #   # does not have a unique id.
       #   Book.insert_all!([
       #     { id: 1, title: 'Rework', author: 'David' },
       #     { id: 1, title: 'Eloquent Ruby', author: 'Russ' }
       #   ])
-      #
       def insert_all!(attributes, returning: nil)
         InsertAll.new(self, attributes, on_duplicate: :raise, returning: returning).execute
       end
@@ -217,43 +211,32 @@ module ActiveRecord
       #   <tt>returning: %w[ id name ]</tt> to return the id and name of every successfully inserted
       #   record or pass <tt>returning: false</tt> to omit the clause.
       #
-      # [:unique_by]
-      #   (Postgres and SQLite only) In a table with more than one unique constraint or index,
+      # [:unique_index]
+      #   (Postgres and SQLite only) In a table with more than one unique constaint or index,
       #   new records may be considered duplicates according to different criteria. For MySQL,
       #   an upsert will take place if a new record violates _any_ unique constraint. For
       #   Postgres and SQLite, new rows will replace existing rows when the new row has the
-      #   same primary key as the existing row. In case of SQLite, an upsert operation causes
-      #   an insert to behave as an update or a no-op if the insert would violate
-      #   a uniqueness constraint. By defining <tt>:unique_by</tt>, you can supply
-      #   a different unique constraint for matching new records to existing ones than the
-      #   primary key.
-      #
-      #   (For example, if you have a unique index on the ISBN column and use that as
-      #   the <tt>:unique_by</tt>, a new record with the same ISBN as an existing record
-      #   will replace the existing record but a new record with the same primary key
-      #   as an existing record will raise <tt>ActiveRecord::RecordNotUnique</tt>.)
-      #
-      #   Indexes can be identified by an array of columns:
+      #   same primary key as the existing row. By passing <tt>:unique_index</tt>, you can supply
+      #   a different index than the primary key one to match new records to existing ones.
       #
-      #     unique_by: { columns: %w[ isbn ] }
+      #   For example, if you have a unique index on the ISBN column and pass that as
+      #   <tt>:unique_index</tt>, a new record with the same ISBN as an existing record
+      #   will replace the existing record but a new record with the same primary key
+      #   as an existing record will raise <tt>ActiveRecord::RecordNotUnique</tt>.
       #
-      #   Partial indexes can be identified by an array of columns and a <tt>:where</tt> condition:
+      #   Indexes can be identified by name:
       #
-      #     unique_by: { columns: %w[ isbn ], where: "published_on IS NOT NULL" }
+      #     unique_index: :isbn
       #
       # ==== Examples
       #
-      #   # Given a unique index on <tt>books.isbn</tt> and the following record:
-      #   Book.create!(title: 'Rework', author: 'David', isbn: '1')
+      #   # Insert multiple records, performing an upsert when records have duplicate ISBNs.
+      #   # Here 'Eloquent Ruby' will overwrite 'Rework' because its ISBN is duplicate.
       #
-      #   # Insert multiple records, allowing new records with the same ISBN
-      #   # as an existing record to overwrite the existing record.
-      #   # ('Eloquent Ruby' will overwrite 'Rework' because its ISBN is duplicate)
       #   Book.upsert_all([
-      #     { title: 'Eloquent Ruby', author: 'Russ', isbn: '1' },
-      #     { title: 'Clean Code', author: 'Robert', isbn: '2' }
-      #   ], unique_by: { columns: %w[ isbn ] })
-      #
+      #     { title: 'Rework', author: 'David', isbn: '1' },
+      #     { title: 'Eloquent Ruby', author: 'Russ', isbn: '1' }
+      #   ], unique_index: :isbn)
       def upsert_all(attributes, returning: nil, unique_by: nil)
         InsertAll.new(self, attributes, on_duplicate: :update, returning: returning, unique_by: unique_by).execute
       end
author	Kasper Timm Hansen <kaspth@gmail.com>	2019-03-09 01:15:17 +0100
committer	Kasper Timm Hansen <kaspth@gmail.com>	2019-03-20 13:28:18 +0100
commit	2ea5c24085e39efb6c1bc6e7062bcc5f496d1fa6 (patch)
tree	9a6448dca6dcd973957584686edf60961189ae01 /activerecord/lib/active_record/persistence.rb
parent	c11d115fa3c58ec6bde5e9799673cee381d22d47 (diff)
download	rails-2ea5c24085e39efb6c1bc6e7062bcc5f496d1fa6.tar.gz rails-2ea5c24085e39efb6c1bc6e7062bcc5f496d1fa6.tar.bz2 rails-2ea5c24085e39efb6c1bc6e7062bcc5f496d1fa6.zip