sql - Standalone query returns 87 rows in 2 minutes, but batch script gets stuck in infinite loop - Stack Overflow

admin•2025-04-18 14:31:59•questions•阅读0

I'm encountering an issue where a standalone query runs in about 2 minutes and returns 87 records,

I'm encountering an issue where a standalone query runs in about 2 minutes and returns 87 records, yet when I embed it in a batch processing script, the script seems to hang in an infinite loop. I've let the script run for over 2 hours with no progress.

The query is part of a loop that's supposed to insert rows in batches. Typically, towards the end of the job there are very few rows left to process, so I wouldn't expect such a drastic slowdown.

For context, my tables have huge volumes of data:

Source table: ~600 million rows

Existing table: ~900 million rows

Here is a simplified version of my script with masked table and column names:

IF OBJECT_ID('Temp.TempArchive', 'U') IS NOT NULL
BEGIN
    DROP TABLE Temp.TempArchive;
    RAISERROR('Existing table Temp.TempArchive dropped.', 0, 1) WITH NOWAIT;
END;

CREATE TABLE Temp.TempArchive 
(
    RecordID BIGINT NOT NULL PRIMARY KEY
);
RAISERROR('Table Temp.TempArchive created.', 0, 1) WITH NOWAIT;

CREATE INDEX IDX_TempArchive_RecordID 
ON Temp.TempArchive (RecordID);

RAISERROR('Index IDX_TempArchive_RecordID created on RecordID.', 0, 1) WITH NOWAIT;

DECLARE @BatchSize INT = 100, @RowsAffected INT = 1;
DECLARE @BatchStartTime DATETIME, @BatchEndTime DATETIME, @BatchTime INT;
DECLARE @OverallStartTime DATETIME = GETDATE(), @OverallEndTime DATETIME, @OverallTime INT;
DECLARE @IterationNumber INT = 1;
DECLARE @LastID BIGINT = 1626851290;

WHILE (@RowsAffected > 0)
BEGIN
    SET @BatchStartTime = GETDATE();
    RAISERROR('STARTED', 0, 1) WITH NOWAIT;

    DECLARE @InsertedIDs TABLE (RecordID BIGINT);

    INSERT INTO Temp.TempArchive (RecordID)
    OUTPUT inserted.RecordID INTO @InsertedIDs
        SELECT DISTINCT TOP (@BatchSize) src.RecordID
        FROM SourceTable src WITH (NOLOCK)
        WHERE NOT EXISTS (SELECT 1 
                          FROM ExistingTable chk WITH (NOLOCK)
                          WHERE chk.RecordID = src.RecordID)
          AND src.RecordID IS NOT NULL
          AND src.RecordID > @LastID
        ORDER BY src.RecordID
        OPTION (RECOMPILE);

    SET @RowsAffected = @@ROWCOUNT;

    IF EXISTS (SELECT 1 FROM @InsertedIDs)
    BEGIN
        SELECT @LastID = MAX(RecordID) FROM @InsertedIDs;
    END;

    SET @BatchEndTime = GETDATE();
    SET @BatchTime = DATEDIFF(SECOND, @BatchStartTime, @BatchEndTime);

    RAISERROR('LastID %I64d', 0, 1, @LastID) WITH NOWAIT;
    RAISERROR('Inserted %d rows in this batch. Batch time: %d seconds.', 0, 1, @RowsAffected, @BatchTime) WITH NOWAIT;
    RAISERROR('Iteration Number %d', 0, 1, @IterationNumber) WITH NOWAIT;
    SET @IterationNumber = @IterationNumber + 1;
    RAISERROR('====================================================================================', 0, 1) WITH NOWAIT;
END

SET @OverallEndTime = GETDATE();
SET @OverallTime = DATEDIFF(SECOND, @OverallStartTime, @OverallEndTime);

RAISERROR('Data inserted successfully. Total time: %d seconds.', 0, 1, @OverallTime) WITH NOWAIT;

And here's the standalone version of the query:

SELECT DISTINCT TOP 10000 src.RecordID
FROM SourceTable src WITH (NOLOCK)
WHERE NOT EXISTS (SELECT 1 
                  FROM ExistingTable chk WITH (NOLOCK)
                  WHERE chk.RecordID = src.RecordID)
  AND src.RecordID IS NOT NULL
  AND src.RecordID > 1626851290
ORDER BY src.RecordID;

The standalone query executes in about 2 minutes and returns 87 rows, but when executed within the loop, the process never completes.

Any suggestions on why the query might run fine on its own but get stuck in the batch loop?

Could this be related to issues such as query plan caching, locking, or something else inherent to the batch process? Solved a few issues with parameter sniffing but this is not a stored procedure.

I tried two approaches:

I ran the query by itself. It completed in about 2 minutes and returned 87 rows.
I embedded the same query in a loop that processes rows in batches (using a TOP clause and updating a marker variable). I expected the loop to process those 87 records and then exit once there were no more rows to insert.

What I was expecting was that the batch loop would terminate as soon as there were no more qualifying rows left (i.e. when @@ROWCOUNT is 0). Instead, the script appears to get stuck—running indefinitely (I even let it run for 2 hours and 30 minutes) without finishing, even though the standalone query shows that very few rows remain to be processed at the end.

The query is part of a loop that's supposed to insert rows in batches. Typically, towards the end of the job there are very few rows left to process, so I wouldn't expect such a drastic slowdown.

For context, my tables have huge volumes of data:

Source table: ~600 million rows

Existing table: ~900 million rows

Here is a simplified version of my script with masked table and column names:

IF OBJECT_ID('Temp.TempArchive', 'U') IS NOT NULL
BEGIN
    DROP TABLE Temp.TempArchive;
    RAISERROR('Existing table Temp.TempArchive dropped.', 0, 1) WITH NOWAIT;
END;

CREATE TABLE Temp.TempArchive 
(
    RecordID BIGINT NOT NULL PRIMARY KEY
);
RAISERROR('Table Temp.TempArchive created.', 0, 1) WITH NOWAIT;

CREATE INDEX IDX_TempArchive_RecordID 
ON Temp.TempArchive (RecordID);

RAISERROR('Index IDX_TempArchive_RecordID created on RecordID.', 0, 1) WITH NOWAIT;

DECLARE @BatchSize INT = 100, @RowsAffected INT = 1;
DECLARE @BatchStartTime DATETIME, @BatchEndTime DATETIME, @BatchTime INT;
DECLARE @OverallStartTime DATETIME = GETDATE(), @OverallEndTime DATETIME, @OverallTime INT;
DECLARE @IterationNumber INT = 1;
DECLARE @LastID BIGINT = 1626851290;

WHILE (@RowsAffected > 0)
BEGIN
    SET @BatchStartTime = GETDATE();
    RAISERROR('STARTED', 0, 1) WITH NOWAIT;

    DECLARE @InsertedIDs TABLE (RecordID BIGINT);

    INSERT INTO Temp.TempArchive (RecordID)
    OUTPUT inserted.RecordID INTO @InsertedIDs
        SELECT DISTINCT TOP (@BatchSize) src.RecordID
        FROM SourceTable src WITH (NOLOCK)
        WHERE NOT EXISTS (SELECT 1 
                          FROM ExistingTable chk WITH (NOLOCK)
                          WHERE chk.RecordID = src.RecordID)
          AND src.RecordID IS NOT NULL
          AND src.RecordID > @LastID
        ORDER BY src.RecordID
        OPTION (RECOMPILE);

    SET @RowsAffected = @@ROWCOUNT;

    IF EXISTS (SELECT 1 FROM @InsertedIDs)
    BEGIN
        SELECT @LastID = MAX(RecordID) FROM @InsertedIDs;
    END;

    SET @BatchEndTime = GETDATE();
    SET @BatchTime = DATEDIFF(SECOND, @BatchStartTime, @BatchEndTime);

    RAISERROR('LastID %I64d', 0, 1, @LastID) WITH NOWAIT;
    RAISERROR('Inserted %d rows in this batch. Batch time: %d seconds.', 0, 1, @RowsAffected, @BatchTime) WITH NOWAIT;
    RAISERROR('Iteration Number %d', 0, 1, @IterationNumber) WITH NOWAIT;
    SET @IterationNumber = @IterationNumber + 1;
    RAISERROR('====================================================================================', 0, 1) WITH NOWAIT;
END

SET @OverallEndTime = GETDATE();
SET @OverallTime = DATEDIFF(SECOND, @OverallStartTime, @OverallEndTime);

RAISERROR('Data inserted successfully. Total time: %d seconds.', 0, 1, @OverallTime) WITH NOWAIT;

And here's the standalone version of the query:

SELECT DISTINCT TOP 10000 src.RecordID
FROM SourceTable src WITH (NOLOCK)
WHERE NOT EXISTS (SELECT 1 
                  FROM ExistingTable chk WITH (NOLOCK)
                  WHERE chk.RecordID = src.RecordID)
  AND src.RecordID IS NOT NULL
  AND src.RecordID > 1626851290
ORDER BY src.RecordID;

The standalone query executes in about 2 minutes and returns 87 rows, but when executed within the loop, the process never completes.

Any suggestions on why the query might run fine on its own but get stuck in the batch loop?

Could this be related to issues such as query plan caching, locking, or something else inherent to the batch process? Solved a few issues with parameter sniffing but this is not a stored procedure.

I tried two approaches:

I ran the query by itself. It completed in about 2 minutes and returned 87 rows.
I embedded the same query in a loop that processes rows in batches (using a TOP clause and updating a marker variable). I expected the loop to process those 87 records and then exit once there were no more rows to insert.

Share Improve this question edited Mar 25 at 5:13 marc_s 756k184 gold badges1.4k silver badges1.5k bronze badges asked Mar 25 at 4:53 minato namikaze 291 silver badge6 bronze badges

1 Are there any records with the followin filters: src.RecordID > 1626851290 – SouravA Commented Mar 25 at 5:56
1 Why the loop ? Does the insert into select not inserts all rows ? Can there be rows inserted into sourcetable while the insert into is running ? In that case, this will loop and loop and loop until there is a moment that no rows are added to sourcetable, wich might seem as forever – GuidoG Commented Mar 25 at 6:41
1 what is the print output when it gets stuck? – Martin Smith Commented Mar 25 at 6:56
Your @insertedids table will get pretty large, maybe that's whats taking time, when you fetch the max value. I usually use two tables, one for inserted output and one final table – siggemannen Commented Mar 25 at 7:49
1 Can you share the query plans for this script? pastetheplan – Charlieface Commented Mar 25 at 13:36

| Show 5 more comments

1 Answer 1

Sorted by: Reset to default 1

Honestly, you are probably better just removing the loop and just doing it all at once.

INSERT INTO Temp.TempArchive (RecordID)
SELECT DISTINCT
  src.RecordID
FROM SourceTable src
WHERE NOT EXISTS (SELECT 1 
    FROM ExistingTable chk
    WHERE chk.RecordID = src.RecordID
)
  AND src.RecordID > 1626851290
-- why ORDER BY src.RecordID ?? It's not necessary and makes little sense
OPTION (RECOMPILE);

What you definitely need on such a large table is an index.

SourceTable (RecordID)
ExistingTable (RecordID)

If either ExistingTable or SourceTable actually has only a few distinct values of RecordID then an aggregated indexed view might be more beneficial.

CREATE VIEW dbo.vSourceTable_RecordIDs
WITH SCHEMABINDING
AS
SELECT
  s.RecordID,
  COUNT_BIG(*) AS Count   -- necessary for the indexed view
FROM dbo.SourceTable s   -- or ExistingTable
GROUP BY
  s.RecordID;

CREATE UNIQUE CLUSTERED INDEX CX ON dbo.vSourceTable_RecordIDs (RecordID);

You can drop the index IDX_TempArchive_RecordID on Temp.TempArchive it's completely superfluous.
Don't use NOLOCK unless you really know what you're doing, it has serious data integrity implications.

发布者：admin，转转请注明出处：http://www.yc00.com/questions/1744217254a4563612.html

admin

questions
javascript - How to mock out sub-components when unit testing a React component with Jest - Stack Overflow
I have a React ponent that I am trying to write some tests around.I have broken it down to the simple
admin
25分钟前
20
questions
html - calling javascript function using onkeyup - Stack Overflow
I created a function to enabledisable a submit button, and I want it to run when user types something
admin
23分钟前
30
questions
reactjs - Javascript react-jss hover not changing color - Stack Overflow
I am trying out React-Jss from cssinjsreact.jss and this is what I've done upto now:Installation:
admin
21分钟前
20
questions
dataframe - Performance of BigQuery API Client vs BigQuery BigFrames? - Stack Overflow
In a BigQuery notebook, if I want to run a query and store the results into a dataframe, I can use the
admin
20分钟前
20
questions
javascript - Links lose their css background-color hover attribute when changing background with jQuery - Stack Overflow
I've got a menu like this one :<ul id="menu"><li><a href="#" id
admin
20分钟前
20
questions
javascript - Canvas rotation doesn't work properly - Stack Overflow
I am having troubles trying to rotate a rectangle via JavascriptCanvas API.Here is the code:G = {};
admin
19分钟前
20
questions
front end - How can I edit a post from the frontend?
I have learned how to create a post from the frontend but how about editing it ?This is the code I am trying to create
admin
18分钟前
20
questions
javascript - TypeError: (intermediate value) is not a function - Stack Overflow
so I'm currently making a Discord bot using Discord.JS V13.I keep getting this error when I run t
admin
16分钟前
20
questions
javascript - How to save data in expressjssession NodeJS - Stack Overflow
I have 3 files : app.js, index.js(routes), Users.js(controller)Once my user is loggedIn (verification d
admin
16分钟前
30
questions
javascript - disabling text selection is not working in IE using jquery - Stack Overflow
I tried disabling text selection using JQuery. It's working fine in Firefox but not in IE. I have
admin
16分钟前
30
questions
monorepo - Using Local Packages in GitHub Actions with pnpm Workspace - Stack Overflow
ProblemI have a monorepo using turborepo and pnpm workspaces. Inside it, there's an internal pack
admin
11分钟前
20
questions
php - Yahoo Contact API - Stack Overflow
I am working on a simple email importer script. From what posts on this site have said, the ones that a
admin
11分钟前
20
questions
plugins - Auto trigger of popup
I am usingfor pop up. This plugin basically provides you two options to trigger the popup, one is through the link an
admin
8分钟前
00
questions
java - Difference between chain and flatMap in Uni Mutiny, or is it just about readability? - Stack Overflow
I’ve been working with Uni Mutiny for reactive programming in Java and noticed that both chain and flat
admin
7分钟前
00
questions
javascript - How to stop a jQuery rotate animation? - Stack Overflow
I'm working on a audio multimedia player (turntable) based on jplayer plugin, and I'm using t
admin
6分钟前
00
questions
jquery - JavaScript test if date (in string format) is more than 30 days ago - Stack Overflow
i have the date in a string format like so '1122009' (mdyyyy)I need to test if that is gr
admin
6分钟前
00
questions
javascript - How to check if there is already an image in dropzone? - Stack Overflow
I have the following code:CODE JS:Dropzone.autoDiscover = false;var myDropzone = new Dropzone("di
admin
5分钟前
00
questions
javascript - Why my filtering and sorting is not working? - Stack Overflow
I have an array of object event where each event has a start date and an end date.I am trying to filter
admin
5分钟前
00
questions
Tailwind Keyframes & Animation not working in production - Stack Overflow
Seem to have a really weird issue where keyframes and animations work in dev but not in production when
admin
1分钟前
00
questions
javascript - How to make a button inside Material-UI tooltip's title - Stack Overflow
I was able to pass Material-UI's IconButton inside Material-UI's Tooltip title.All the views
admin
15秒前
00

发表回复

评论列表（0条）

暂无评论

sql - Standalone query returns 87 rows in 2 minutes, but batch script gets stuck in infinite loop - Stack Overflow

1 Answer 1

发表回复

评论列表（0条）

联系我们

400-800-8888

sql - Standalone query returns 87 rows in 2 minutes, but batch script gets stuck in infinite loop - Stack Overflow

1 Answer 1

相关推荐

发表回复

评论列表（0条）

联系我们

400-800-8888