English 中文(简体)
Efficient Method to Check for Referenced DefinedNames in a Workbook using OpenXML or ClosedXML
原标题:

I am working on a problem where I need to ascertain whether a specific DefinedName in a workbook is referenced within a particular worksheet. I am currently using the OpenXML library to facilitate this.

Here is my current approach:

Firstly, I construct a HashSet containing all calculations within the worksheet. This is achieved through the following method:

private HashSet<string> BuildAllCalculationsSet(SpreadsheetDocument document)
{
    HashSet<string> allCalculations = new HashSet<string>();

    // Iterate over all worksheets
    foreach (var worksheetPart in document.WorkbookPart.WorksheetParts)
    {
        var cellsWithFormulas = worksheetPart.Worksheet.Descendants<Cell>().Where(c => c.CellFormula != null);

        // Iterate over all cells with formulas
        foreach (var cell in cellsWithFormulas)
        {
            // Add the cell s formula to the set of all calculations
            allCalculations.Add(cell.CellFormula.Text);
        }
    }

    return allCalculations;
}

Subsequently, I employ this HashSet to determine if a specific DefinedName is referenced within the worksheet:

private bool IsNamedRangeReferenced(HashSet<string> allCalculations, string namedRange)
{
    // Check if the named range is referenced in any calculation
    return allCalculations.Any(calculation => calculation.Contains(namedRange));
}

This method is utilized as follows:

if (IsNamedRangeReferenced(allCalculations, namedRange.Name.Value))
{
    isReferenced = true;
    break;
}

Now, I am contemplating incorporating recursion and Graphs with Depth First Search algo to also account for scenarios where a DefinedName might be referenced by another DefinedName which in turn is referenced in the worksheet s calculations. However, before proceeding with this approach, I am curious to explore if there might be a more efficient solution.

Given that some of the documents can contain more than 30,000 DefinedNames, it is crucial to strike a balance between performance and reliability. I am open to solutions using either OpenXML, which I m currently employing, or ClosedXML.

Any advice or guidance would be greatly appreciated!

Thank you in advance.

问题回答

暂无回答




相关问题
Anyone feel like passing it forward?

I m the only developer in my company, and am getting along well as an autodidact, but I know I m missing out on the education one gets from working with and having code reviewed by more senior devs. ...

NSArray s, Primitive types and Boxing Oh My!

I m pretty new to the Objective-C world and I have a long history with .net/C# so naturally I m inclined to use my C# wits. Now here s the question: I feel really inclined to create some type of ...

C# Marshal / Pinvoke CBitmap?

I cannot figure out how to marshal a C++ CBitmap to a C# Bitmap or Image class. My import looks like this: [DllImport(@"test.dll", CharSet = CharSet.Unicode)] public static extern IntPtr ...

How to Use Ghostscript DLL to convert PDF to PDF/A

How to user GhostScript DLL to convert PDF to PDF/A. I know I kind of have to call the exported function of gsdll32.dll whose name is gsapi_init_with_args, but how do i pass the right arguments? BTW, ...

Linqy no matchy

Maybe it s something I m doing wrong. I m just learning Linq because I m bored. And so far so good. I made a little program and it basically just outputs all matches (foreach) into a label control. ...

热门标签